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THE simplest possible form of statistical classification is “division” (as the 
logicians term it) “by dichotomy,” ie. the sorting of the objects or individuals 
observed into one or other of two mutually exclusive classes according as they do 
or do not possess some character or attribute; as one may divide men into sane and 
insane, the members of a species of plants into hairy and glabrous, or the members 
of a race of animals into males and females. The mere fact that we do employ 
such a classification in any case must not of course be held to imply a natural and 
clearly defined boundary between the two classes; e.g. sanity and insanity, hairiness 
and glabrousness, may pass into each other by such fine gradations that judgments 
may differ as to the class in which a given individual should be entered. The 
judgment must however be finally decisive; intermediates not being classed as 
such even when observed. 

The theory of statistics of this kind is of a good deal of importance, not 
merely because they are of a fairly common type—the statistics of hybridisation 
experiments given by the followers of Mendel may be cited as recent examples— 
but because the ideas and conceptions required in such theory form a useful 
introduction to the more complex and less purely logical theory of variables. The 
classical writings on the subject are those of De Morgan*, Boole + and Jevons f, 
the method and notation of the latter being used in the following Notes, the first 
three sections of which are an abstract of the two memoirs referred to below§, 

* Formal Logic, chap. vut., ‘‘On the Numerically Definite Syliogism,” 1847. 

+ Analysis of Logic, 1847. Laws of Thought, 1854. 

t “Ona General System of Numerically Definite Reasoning,” Memoirs of Manchester Literary and 
Philosophical Society, 1870. Reprinted in Pure Logic and other Minor Works, Macmillan, 1890. 

§ ‘*On the Association of Attributes in Statistics,” Phil. Trans. A, Vol. 194 (1900), p. 257. ‘On 
the theory of Consistence of ‘Logical Class Frequencies,” Phil. Trans. A, Vol. 197 (1901), p. 91. 
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1. Notation ; terminology ; relations between the class frequencies ; tabulation. 


The notation used is as follows*: 


N =total number of observations, 

(A) =no. of objects or individuals possessing attribute A, 

(a) = - ee not possessing attribute A, 

(AB)= a re possessing both attributes A and B, 
(AB) = ms o = attribute A but not B. 
(aB) = ‘3 ” = attribute B but not A, 
(a8) = J ‘3 not possessing either attribute A or B, 


and so on for as many attributes as are specified. A class specified by n attributes 
in this notation may be termed a class of the nth order. The attributes denoted 
by English capitals may be termed positive attributes, and their contraries, 
denoted by the Greek letters, negative attributes. If two classes are such that 
every attribute in the one is the negative or contrary of the corresponding 
attribute in the other they may be termed contrary classes, and their frequencies 
contrary frequencies ; (AB) and (#8), (ABy) and (a8C) are for instance pairs of 
contraries. 

If the complete series of frequencies arrived at by noting n attributes is being 
tabulated, frequencies of the same order should be kept together. Those of the 
same order are best arranged by taking separately the set or “aggregate” of 
frequencies, derivable from each positive class by substituting negatives for one or 
more of the positive attributes. Thus the frequencies for the case of three 
attributes may conveniently be tabulated in the order— 

Order 0, WV 

Order 1. (A), (a): (B), (8B): (©), (y) 

Order 2. (AB), (AB), (aB), (48): (AC), (Ay), (a0), (ay): (BC), (By), } (1). 

(8C), (By) 

Order 3. (ABC), (aBC), (ABC), (ABy), (a8C), (aBy), (ABy), (aBy) 

But since all frequencies are used non-exclusivel, , (A) denoting the frequency 
of objects possessing the attribute A with or without others and so forth, the 
frequency of any class can always be expressed in terms of the frequencies of 
classes of higher order; that is to say we have 


N =(A)+(2)=(B) + (8) =ete. 
= (AB) +(A8) + (aB) + (a8) = ete. | 
(A) = (AB) + (48) 
= (ABC) + (ABy) + (ABC) + (ABy) = ete. 


* I have substituted small Greek letters for Jevons’ italics. Italics are rather troublesome when 
speaking, as one has to spell out a group like AbeDE, “big A, little b, little c, big D, big EB.” It 


is simpler to say A8yDE. The Greek becomes more troublesome when many letters are wanted, 
owing to the non-correspondence of the alphabets, but this is not often of consequence. 
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Hence it is quite unnecessary to state all the frequencies as under (1); if 
space is of importance no more need be given than the eight frequencies of the 
third order, the ultimate frequencies as they may be termed (i.e. frequencies 
of classes specified by the whole number of attributes noted). The number of 
frequencies in an aggregate of order n, is evidently 2", so that not more than 
2” frequencies need be stated in any case where n attributes are observed. 


Any other set of 2” independent frequencies may however be chosen instead 
of the 2” ultimate frequencies, the set formed by taking NV together with the 
frequencies of all the positive classes offering several advantages. It is not 
difficult to see that any frequency whatever can be expressed in terms of the 
number of observations and the positive-class frequencies by using the relations 
(2). We have for instance 


(a) = N <5 (A) 


(aB) =(B)—(AB) 
(a8) =(a)—(aB) = N—(A)—-(B)+ (AB) | easel 
(aBy) = (a8) —(aBC) = (a8) — (aC) + (2BC) 


N —(A)—(B) —(C) +(AB) + (AC) + (BC) — (ABC) 


and so on. 


To take a very simple example with two attributes only, consider the state- 
ment of results of one of Mr Bateson’s experiments on hybridisation with poultry, 
Leghorn-Dorking hybrids when crossed inter se produced offspring of varied forms ; 
some having the rose comb and some not, some having the extra toe that 
characterises the Dorking and some not. Using A to denote “rose comb,” B to 
denote extra toe, the numerical results may be completely expressed in either of 
the following forms 


I II 
(AB) = 208 N =336 
(AB)= 63 (A) =271 
(aB) = 54 (B) =262 
(a8) = 11 (AB) = 208. 


The advantages of the second form of tabulation are obvious; it gives at sight 
the whole number of observations and the numbers of A’s and B’s. The first 
table gives neither without reckoning, yet both are equally complete. 


A rather interesting case arises where the frequencies of contrary classes are 
equal, as may be the case if the character dealt with is really variable and the 
points of division between A’s and a’s are taken at the medians. Such a condition 
implies necessary relations between the class-frequencies of any odd order and the 
frequencies of next lower order, but for the discussion of the case I must refer to 
the first memoir mentioned in my note § on p. 121. 

16—2 
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2. Consistence and Inference. 


Although the positive-class frequencies (including NV under that heading) are 
all independent in the sense that no single one can be expressed in terms of the 
others, they are nevertheless subject to certain limiting conditions if they are to 
be self-consistent, ie. such as might have been observed in one and the same field 
of observation or “ universe,” to use the convenient term of the logicians. Consider 
the case of three attributes, for example. It is evident that we must have 


(AB) + 0 as (AB) must not be as eiall 
< (A) +(B)—-W as (a) r P 
$ (A) as (AB) : ES Geceeiaans (4); 
> (B) as (4B) » ” 


and similar conditions must hold for (AC) and (BC). But these are not the only 
conditions that must hold. The second-order frequencies must not only be such 
as not to imply negative values for the frequencies of other classes of their own 
aggregates, but also must not imply negative values for any of the third-order 
Srequencies. Expanding all the third-order frequencies in terms of the frequencies 
of positive classes, and putting the resulting expansion ¢ 0, we have 


or the frequency given 
below will be negative 


(ABC) + 0 (ABC) [1] 
+ (AB) + (AC) —-(A) (ABy) [2] 
+ (AB) +(BC) —(B) (aBy) [3] 
+ (AC) +(BC) —(C) (aBC) [4] ‘ 
+ (AB) (ABy) [5] 7° © 
$ (AC) (ABC) [6] 
+ (BC) (aBC) [7] 
> (AB) + (AC)+(BC)-(A)-(B)-(C)+N — (aBy) [8] 





But if any one of the minor limits [1]—[4] be greater than any one of the 
major limits [5]—[8] these conditions are impossible of fulfilment. There are 
four minor limits to be compared with four major limits or sixteen comparisons in 
all to be made; but the majority of these, twelve in all, only lead back to 
conditions of the form (4). The four comparisons of expansions due to contrary 
frequencies alone lead to new conditions—viz. 


(AB)+(AC)-(BOE)> (A) te (6). 
(AB) —(AC) + (BC) > (B) | 
— (AB) + (AC) +(BC) + (C) 


(AB) + (AC) +(BC) ¢ (A) + (B) +(C)- | 
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These conditions give limits to any one of the three frequencies (AB), (AC) and 
(BC) in terms of the other two and the frequencies of the first order, i.e. enable us 
to infer limits to the one class-frequency in terms of the others. It will very 
usually happen in practical statistical cases that the limits so obtained are value- 
less, lying outside those given by the simpler conditions (4), but that is merely 
because in practice the values of the assigned frequencies, e.g. (AB) and (AC), 


seldom approach sufficiently closely to their limiting values to render inference 
possible. 


3. Association. 


Two attributes, A and B, are usually defined to be independent, within any 
given field of observation or “ universe,” when the chance of finding them together 
is the product of the chances of finding either of them separately. The physical 
meaning of the definition seems rather clearer in a different form of statement, 
viz. if we define A and B to be independent when the proportion of A’s amongst 
the B's of the given universe is the same as in that universe at large. If for 
instance the question were put “ What is the test for independence of small-pox 
attack and vaccination?”, the natural reply would be “The percentage of 
vaccinated amongst the attacked should be the same as in the general popu- 
lation” or “The percentage of attacked amongst the vaccinated should be the 
same as in the general population.” The two definitions are of course identical 
in effect, and permit of the same simple symbolical expression in our notation ; 
the criterion of independence of A and B is in fact 


(AB)= a 


In this equation the attributes specifying the universe are understood, not 
expressed. If all objects or individuals in the universe are to possess an attribute 
or series of attributes K it may be written 

ABK _ (AR) (BR) 

( 1 ) = (K) ° 
An equation of such form must be recognised as the criterion of independence 
for A and B within the universe K. As I have shewn in the first memoir referred 
to in note §, p. 121, if the relation (7) hold good, the three similar relations for the 
remaining frequencies of the “aggregate ”—i.e. the set of frequencies obtained by 
substituting their contraries a, 8 for A or B or both—must also hold, viz. 


(a8) = @e 
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for we have at once 





(aB) = (B) — (4B) =O _ V0), 


and so on. The case of two attributes is thus quite simple; the definition of 
independence is almost intuitive, and the criterion need only be applied to one 
frequency of the aggregate. The two attributes are termed positively or negatively 
associated according as (AB) is greater or less than the value it would have 
in the case of independence, or, to put the same thing another way, according as 


(AB)/(A) is greater or less than (B)/N or (AB)/(B) greater or less than (A)/N. 


If more than two attributes are noted in the record they must be dealt with, 
in the first instance, pair by pair as above, but subsequently the association 
between each pair should be observed in the sections of material or sub-universes 
defined by the other attributes. In the case of three attributes, for instance, we 
have not only to deal with the association between A and B in the universe at 
large but also in the universe of C’s and in the universe of y’s—associations which 
will be tested by comparing (ABC)/(AC) with (BC)/(C) or (ABC)/(BC) with 
(AC)/(C), and (A By)/(Ay) with (By)/(y) and so on. Such “ partial” associations 
are of great practical importance, as a test for the correctness or otherwise of 
physical interpretations placed on any “total” associations observed. When 
A and B are found to be associated it is a common form of argument to say that 
the association is not “direct” but due to the association of A with C and B 
with C; the argument may be tested at once by finding whether A and B are 
still associated in the separated universes C and y. If they are the argument is 
baseless. It has been said for instance that the association between vaccination 
and protection from small-pox is due to the association of both with sanitary 
conditions (a larger proportion of the upper classes than of the lower classes being 
vaccinated, according to the argument). To test the argument the “universe” or 
material observed should be limited either to those living wholly in sanitary 
conditions or to those living wholly in insanitary conditions (it does not matter 
which). “Partial” associations again are of importance to the biologist in the 
theory of heredity. If an attribute be heritable, its presence in the parent or the 
grandparent is associated with its presence in the offspring; but the physical 
interpretation to be placed on such inheritance depends very largely on whether 
there is also a partial inheritance from the grandparent, the presence of the 
attribute in the grandparent being associated with its presence in the offspring 
even when the parents either all possess or all do not possess the attribute. 


It is important to notice that the test for association is necessarily based on a 
comparison of percentages or proportions, e.g. (AB)/(A) and (B)/N. The mere 
fact that a certain number of A’s are B gives no physical information; besides 
knowing how many A’s are B you must know also how many non-A’s are B 
or what proportion of A’s exists in the given universe at large. In an investigation 
as to the causation of A it is therefore just as important to observe non-A’s as A’s. 
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This point is frequently forgotten. In an investigation as to the inheritance 
of deaf-mutism in America*, for instance, only the offspring of deaf-mutes were 
observed, and the argument cohsequently breaks down on page after page into 
conjectural statements as to points on which the editor has no information— 
e.g. the proportion of deaf-mutes amongst the children of normals. 


The difference of (AB)/(A) from (B)/N and of (AB)/(B) from (A)/N are of 
course not, as a rule, the same, and it would be useful and convenient to measure 
the “association ” by some more symmetrical method—a “ coefficient of association ” 
ranging between +1 like the coefficient of correlation. In the first memoir 
referred to in note §, p. 121, such a coefficient, of empirical form, was suggested, 
but that portion of the memoir should now be read in connection with a later 
memoir by Professor Pearson f. 


4. On the theory of complete independence of a series of Attributes. 


The tests for independence are by no means simple when the number of 
attributes is more than two. Under what circumstances should we say that a 
series of attributes ABCD... were completely independent? I believe not a few 
statisticians would reply at once “if the chance of finding them together were 
equal to the product of the chances of finding them separately,” yet such a reply 
would be in error. The mere result 


(ABCD...) (A) (B) (©) (D) ~ 
N = N . N . N . WV <—2s0  . §~ eeeee coecoete 


does not in general give any information as to the independence or otherwise 
of the attributes concerned. If the attributes are known to be completely inde- 
pendent then certainly the relation (9) holds good, but the converse is not true. 
“Equations of independence” of the form (9) must be shewn to hold for more 
than one class of any aggregate, of an order higher than the second, before the 
complete independence of the attributes can be inferred. 


From the physical point of view complete independence can only be said to 
subsist for a series of attributes ABCD... within a given universe, when every 
pair of such attributes exhibits independence not only within the universe at large 
but also in every sub-universe specified by one or more of the remaining attributes 
of the series, or their contraries. Thus three attributes A, B, C are completely 
independent within a given universe if AB, AC and BC are independent within 
that universe and also 


AB independent within the universes C and y, 


AC B, B, 


» ” »” 


BC A 


» ” ” 


* Marriages of the Deaf in America, ed. by E. A. Fay. Volta Bureau, Washington, 1898. 
+ Phil. Trans. Vol. 195, p. 16. 
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If a series of attributes are completely independent according to this definition 
relations of the form (9) must hold for the frequency of every class of every 
possible order. Take the class-frequency (ABCD) of the fourth order for instance. 
A and B are, by the terms of the definition, independent within the universe CD. 
Therefore 
sp) (ACD) (BOD) 

(ABCD) = (CD) 
But A and C,and also B and C,are independent within the universe D. Therefore 
the fraction on the right is equal to 


1 (AD)(CD) (BD) (CD) _(AD)(BD) (OD) 

(CD) (D) ~— WJ) (Dy , 
But again AD, BD, CD are each independent within the universe at large; 
therefore finally 


__1 (A)D) (BD) OD) _(4)(B)(O)W) 
— a ss. se le 


Any other frequency can be reduced step by step in precisely the same way. 








Now consider the converse problem. The total frequency NV is given and also 
the n frequencies (A), (B), (€), etc. In how many of the ultimate frequencies 
(ABCD...MN), (aBCD...MN), etc. must “relations of independence” of the form 


(AB0D...uH) =“ DEVO 

hold good, in order that complete independence of the attributes may be inferred ? 
The answer is suggested at once by the following consideration. The number of 
ultimate frequencies (frequencies of order n) is 2”; the number of frequencies 
given is n+1. If then all but x+1 of the ultimate frequencies are given in 
terms of the equations of independence, the remaining frequencies are deter- 
minate; either these determinate values must be those that would be given 
by equations of independence, or a state of complete independence must be impossible. 
Suppose all the ultimate class-frequencies to have been tested and found to be 
given by the equations of independence, with the exception of the negative class 
(aBryd...uv)and the n classes with one positive attribute (Afry6s... uv), (aBys...uv), 
etc. Take any one of these untested class-frequencies, (APy8... uv), and we have 
for example 





(ABy6...uv)=(A)—(ABCD ... MN) 
— (ABCD... Mv) 
—other terms with one negative 
—(ABCD... pv) 
— other terms with two negatives 


POU RUUUUC OSS OC OOSSOSTOCOESC SSS E Eee) 


— other terms with n—2 negatives. 
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But all the frequencies on the right are given by the relations of independence. 
Therefore 


(ABy8...pr) = 2 — (B)(C)(D)... (MN) 
— (B)(€)(D)... WD) () 


COO e eer ee eeereseeeeeeeese 


— (B) (7) (8) --- HQ) 

es cece cues eseuscocsaevete }. 
Now consider the terms on the right in the bracket. With the exception of 
the one term (B) (yy) (8)...(«)(v), the remainder can be grouped in pairs of which 
the one member contains (B) and the other (8), the following frequencies in each 
member of the pair being the same. Carrying out this rearrangement the 
expression will read 


(AByB...uv) = A> (I — (By (y) (8)...(u)() 


— (B) (C)(D) ... UD) (Y) 
— (B)(C)(D) ... AD (NY) 
— (B)(C)(D)...(M)(v) 
— (B)(C)(D) ... AD) ) 
— (B)(C)(8) ..- (2) ) 
— (8) (C) (8) ... (4) (») 


a } 
occcccccceneccccescccces iz 


Replace (B) by N-—(§) throughout and rearrange the terms in similar pairs 
containing C and y. (B) and (8) are then eliminated from all the terms and 
the expression then becomes 


eT ee bd {(N™1 + (8) (y) (8) «-- (uw) (») 


— N(y) (8)... (u) @) 

— N(C) (8)... (wu) () 
— N(C)(D)... (M) (N) 
— N (y)(D)... (NM) (N) 
— N(C)(D)...(M)(v) 
— N (y)(D)... (M) (vr) 
ELSE Wt hee bea 


Biometrika 1 
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Replacing C by NV --(y) and regrouping in similar pairs of terms containing (D) 
and (8) this will become 


(Apr... wo) = 42, {IV + (8) (9) (8) «»-() @) 
— N*(D)(E) ...(M) (WN) 
— N*(8)(E) ... (MD) (W) 


— etc.} 





and continuing the same process until all the frequencies (D) (Z)... (M1) (N) are 
eliminated, i.e. n—1 times altogether, 


pe ...~)- 222 oy 8) @)O) 





That is to say the theorem must be true quite generally: “ A series of attributes 
ABC...MN are completely independent if the relations of independence are 
proved to hold for (2"—n+1) of the 2" ultimate frequencies; such relations 
must then hold for the remaining n+1 frequencies also.” If the ultimate 
frequencies are only given by the relations of independence in n cases or less, 
independence may exist for certain pairs of attributes in certain universes but 
not in general, The mere fact of the relation holding for one class, e.g. 


(ABCD... MN) =2.2)© oo (MN) 


implies nothing—in striking contrast to the simple case of two attributes, where 
2”—n+1=1 and only the one class-frequency need be tested in order to see if 
independence exists. In the case of three attributes the number of third-order 
classes is eight, of which four must be tested in order to be certain that complete 
independence exists. In the case of four attributes there are sixteen fourth-order 
classes of which eleven must be tested, and so on. 


I have dealt with the problem hitherto on the assumption that only the first- 
order and the nth order frequencies were given, and that the frequencies of 
intermediate orders were unknown—or at least uncalculated, for of course the 
frequencies of all lower orders may be expressed in terms of those of the nth 
order, If however the frequencies of all orders may be supposed known, the above 
result may be thrown into a somewhat interesting form. It will be remembered 
that the frequency of any class of any order may be expressed in terms of the 
frequencies of the positive classes [(A)(AB)(AC)(ABC) etce.] of its own and 
lower orders. Then complete independence exists for a series of attributes if the 
criterion of independence hold for all the positive-class frequencies up to that of the 
nth order. If we have for instance 


(ABOD ... MN) = yy-, {(A) (B) (0) (D)... (A) (WY), 
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and also 
1 
(BCD ... MN) = yo {(B)(C)(D)...(M) (V)}, 
we must have 
(2BCD ... MN) =(BCD ...MN)—-(ABCD...MN) 


= asi (B)(C)(D)... (ME) (AY) LW - ()} 


“ — (a) (B)(C) (D) ... (M) (), 


and soon. The number of class-frequencies to be tested in order to demonstrate 
the existence of complete independence is, of course, the same as before, viz. 
2°7—n+1. 





It should be noted as a consequence of these results that the definition of 
“complete independence” given on p. 127 is redundant in its terms. It is quite 
true that if complete independence subsist for a series of attributes every possible 
pair must exhibit independence in every possible sub-universe as well as in the 
universe at large, but it is not necessary to apply the criterion of independence to 
all these possible cases. In the case of three attributes for instance the criterion 
of independence need only be applied to four frequencies, as we have just seen, in 
order to demonstrate complete independence; it cannot then be necessary, as 
suggested by the definition, to test nine different associations, viz. 


| AB | | AB|C| | ABly |, 
| AC | |AC| B| | AC |B], 
| BC | | BC | A | | BC |a |, 


in the notation of my memoir on Association (an expression like | AB| C| 
specifying “the association between A and B in the universe of C's”). It is 
in fact only necessary to test | AB|,|AC|, |BC|, and | AB|C| (or one of the 
other three partial associations in positive universes). If these are zero, the 
remaining associations must be zero also; for we are given 


i. “ee 
(ABO)= 2 (A0)(BC) =, (A) (B) (0), 
-, (ABC) = ay (4B) (BO) 
1 


= (AB) (AC) = ete. 

(A) 
ie. | AC|B!,| BC| A |, etc. are zero. Quite generally, it is only necessary, if the 
testing be supposed to proceed from the second order classes upwards, to test one 
of all the possible partial associations corresponding to each positive class. If 
there be four attributes A BCD, the six total associations | AB|, | AC|, | AD|,| BC| 


17—2 
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etc. must first be tried; if they are zero, then follow on with | AB|C|,| AB|D\, 
| AC|D| and | BC| Dj, but not | AC| B| or | AD| B| ete.; if they are zero then 
finally try | AB|CD |, if it also be zero then the attributes are completely inde- 
pendent. It is not necessary to try further | AC| BD| or | AD | BC | etc. 

The inadequacy of the usual treatment of independence arises from the fact 
that it proceeds wholly @ priori, and generally has reference solely to cases of 
artificial chance. The result is an illusory appearance of simplicity. It is pointed 
out that if one “event” can “succeed” in a, and “fail” in b, ways, a second 
succeed in a, and fail in 6, ways, and so on, the combined events can take place 
(succeed or fail) in 

(a, + b,) (ay + by) ... (Gn + Dn) 
ways and succeed in 
Ag... On 


> 


ways. The chance of entire “success” is therefore 


i 
(a, + b,) (a + by) «2. (dn + dn)’ 


the chance of the first event failing and the rest succeeding is 


Btls ssc Ge 


(a, + b,) (az + db.) ... (n+ bp)’ 





and so on for all other possible cases. In short the chance of occurrence of the 
combined independent events is the product of the chances of the separate 
events. There the treatment stops, and all practical difficulties are avoided. In 
such text-book treatment it is given that the events are independent and required 
to deduce the consequences; in the problems that the statistician has to handle 
the consequences—the bare facts—are given and it is required to find whether 
the “events” or attributes are independent, wholly or in part. 


~ 


5. On the fallacies that may be caused by the mixing of distinct records. 


It follows from the preceding work that we cannot infer independence of a 
pair of attributes within a sub-universe from the fact of independence within the 
universe at large. From | AB|=0, we cannot infer | AB|C|=0 or | AB|y|=0, 
although we can of course make the corresponding inference in the case of 
complete association—i.e. from | AB|=1 we do infer | AB| C|=| AB|y|=etc.=1. 
But the converse theorem is also true; a pair of attributes does not necessarily 
exhibit independence within the universe at large even if it exhibit independence 
in every sub-universe ; given 


|AB|C|=0, |AB|y|=0, 


we cannot infer |AB|=0. The theorem is of considerable practical importance 
from its inverse application; ie. even if |AB| have a sensible positive or 
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negative value we cannot be sure that nevertheless | AB|C| and |AB|y| are not 
both zero. Some given attribute might, for instance, be inherited neither in the 
male line nor the female line; yet a mixed record might exhibit a considerable 
apparent inheritance. Suppose for instance that 50°/, of the fathers and of the 
sons exhibit the attribute, but only 10°/, of the mothers and daughters. Then 
if there be no inkeritance in either line of descent the record must give 
(approximately) 


fathers with attribute and sons with attribute 25 °/, 


4 ° 

” ” ” ” without ” 25 I 
e bd - oO 

* without __,, = with = 25 °/, 
; oe 

= _ 6 = without __,, 25 °/., 


mothers with attribute and daughters with attribute 1°/, 


= ° 

? » » » ”» without ” 9 lo 
. bd ° 

rs without _,, — » with Z 9°/, 
4 ° 

a a Bs = » Without ,, $1 °/.. 


If these two records be mixed in equal proportions we get 


parents with attribute and offspring with attribute 13°/, 


” ” » ” 9 without si 17 of Be 
- without _,, ss » with is 17°/, 
” ” ” ” ” without 53 ya 


Here 13/30 =434°/, of the offspring of parents with the attribute possess the 
attribute themselves, but only 30°/, of offspring in general, ie. there is quite 
a large but illusory inheritance created simply by the mixture of the two distinct 
records. A similar illusory association, that is to say an association to which the 
most obvious physical meaning must not be assigned, may very probably occur in 
any other case in which different records are pooled together or in which only 
one record is made of a lot of heterogeneous material. 


Consider the case quite generally. Given that | AB|C| and | AB|y 
zero, find the value of (AB). From the data we have at once 


(Ay) (By) _ [(A) — (AC)] [(B) - (BOY) 


are both 














— a [LV -(0)] 
) = (AC) (BC) 
(ABC) = (cy 
Adding 
(AB) _ N (AC) (BC) — (A) (C) (BC) — (B) (C) (AC) + (A) (B) (C) 


(C) (VW — (C)] 











, 
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Write 
(AB), =7(A)(B), (ACy=77(A)(C), (BO= 5 (B) (C), 
subtract (4B), from both sides of the above equation, simplify, and we have 
(AB) —(AB), = al nc ht ae) ey ‘ 
C(N-(©)] 
That is to say, there will be apparent association between A and B in the universe 
at large unless either A or B is independent of C. Thus, in the imaginary case 
of inheritance given above, if A and B stand for the presence of the attribute 
in the parents and the offspring respectively, and C for the male sex, we find a 
positive association between A and B in the universe at large (the pooled results) 
because A and B are both positively associated with C, ie. the males of both 
generations possess the attribute more frequently than the females. The “ parents 
with attribute” are mostly males; as we have only noted offspring of the same 
sex as the parents, their offspring must be mostly males in the same proportion, 
and therefore more liable to the attribute than the mostly-female offspring of 
“parents without attribute.” It follows obviously that if we had found no 
inheritance to exist in any one of the four possible lines of descent (male-male, 
male-female, female-male, and female-female), no fictitious inheritance could have 
been introduced by the pooling of the four records. The pooling of the two 
records for the crossed-sex lines would give rise to a fictitious negative inherit- 
ance—disinheritance—cancelling the positive inheritance created by the pooling 
of the records for the same-sex lines. I leave it to the reader to verify these 
statements by following out the arithmetical example just given should he so 
desire. 

The fallacy might lead to seriously misleading results in several cases where 
mixtures of the two sexes occur. Suppose for instance experiments were being 
made with some new antitoxin on patients of both sexes. There would nearly 
always be a difference between the case-rates of mortality for the two. If the 
female cases terminated fatally with the greater frequency and the antitoxin were 
administered most often to the males, a fictitious association between “ antitoxin ” 
and “cure” would be created at once. The general expression for (AB)—(AB), 
shews how it may be avoided; it is only necessary to administer the antitoxin 
to the same proportion of patients of both sexes. This should be kept constantly in 
mind as an essential rule in such experiments if it is desired to make the most use 
of the results. 

The fictitious association caused by mixing records finds its counterpart in the 
spurious correlation to which the same process may give rise in the case of 
continuous variables, a case to which attention was drawn and which was fully 
discussed by Professor Pearson in a recent memoir*. If two separate records, for 
each of which the correlation is zero, be pooled together, a spurious correlation 


will necessarily be created unless the mean of one of the variables, at least, be the 
same in the two cases. 


* Phil. Trans. A, Vol. 192, p. 277. 











A FURTHER STUDY OF STATISTICS RELATING TO 
VACCINATION AND SMALLPOX. 


By W. R. MACDONELL, LL.D. 


(1) THE following paper is a continuation of the Note published in Biometrika, 
Vol. 1. No, 3, but I have been able to extend my results to a comparison of the 
differential character of two epidemics in the same city, and to some preliminary 
consideration of the influence of occupation on the existence or non-existence 
of vaccination and on the severity of a smallpox attack. My statistical material is 
taken from: A Summary of Statistics relating to Vaccination and Smallpox as 
observed in the Cases admitted to the City of Glasgow Smallpox Hospital, Belvidere, 
between 10th April, 1900, and 30th June, 1901, by Dr R. S. Thomson and 
Dr Fullarton—a paper read before the Royal Philosophical Society of Glasgow on 
2nd April, 1902—and from supplementary statistics relating to the same epidemic 
kindly sent to me by Dr Brownlee, Physician Superintendent of the Belvidere 
Hospital. The tables were prepared, and the coefficients of correlation with their 
probable errors calculated in the same manner as in the earlier paper. I may 
repeat that I use “mild” as equal to “discrete,” and “severe” as equal to 








“confluent” and “haemorrhagic.” The few cases (only 2} per cent.) of “ doubtful” 
vaccination have been excluded from my tables. The following are my tables 
for the Glasgow epidemic of 1900-1. 











TABLE I. 
| Recoveries | Deaths Totals | 
| Vaccinated... 1493 150 1643 
Unvaccinated 59 | 63 122 
| Totals 15520] 3 1765 | 











r= "6294 + 0296. 
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TABLE II. 
| Mild Severe Totals | 
| Vaccinated... 1361 282 1643 | 
Unvaccinated 53 69 122 | 
Totals 1414 351 1765 
r= 5162 + 0322. 
TABLE III. 
[ Scars Mild | Severe Totals 
| Foveated 573 | 82 655 
| Unfoveated ... 788 200 988 
| Totals 1361 | 282 1643 
r= 19294-0374.” 
TABLE IV. 
Area of Scar Mild Severe Totals 
| Over half square inch... ... 805 113 918 
| Half square inch and under 556 169 725 
Totals 1361 | 282 1643 
r= 2646 + 0301. 
TABLE V. 
Number of Scars Mild | Severe Totals 
Two and upwards 652 105 757 | 
| One 709 177 886 | 
Totals 1361 | 282 1643 | 
r= 151 1+ 0306. a 
TABLE VI. 
Vaccinated Cases. 
Recoveries | Deaths Totals 
Mild ... ... 1336 | 25 1361 
Severe... 157 125 282 
Totals 1493. | 150 1643 











7 = *8603 + ‘0150. 
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rT © ° » en . 
The statistics also enable me to find the coefficient of correlation between age 
and severity of attack. 


TABLE VII. 


Vaccinated Cases. 











| | 
| Years Mild Severe Totals | 
| 
| 
| ORO Bia xe 244 } 14 258 
20 and over... 1117 } 268 1385 | 
Totals i361 =| ~— 282 1643 





= "3693 + ‘0386. 


If the division be made at 25 years instead of 20, r = ‘3297 + ‘0314; if at 35 years, 
r ="3180 + 0293. Calculating this coefficient for the Glasgow epidemic of 1892-95, 
I find r = 3366 + 0715, for a division at 20 years; I am unable to calculate it for 
a division at 25 or 35 years, as there are no divisions at these periods in the 
statistics of the earlier epidemic. 


(2) For the sake of comparison, the coefficients for the two Glasgow epidemics 
are collected in the table below. 


TABLE VIII. 


Coefficient of correlation between oe of —— - 
Vaccination and strength of resistance ... ... 6294 + 0296 ‘7783 + 0365 
Vaccination and severity of attack ... ... 4. 5162 + 0322 9123 +°0181 
Foveation of scar and severity of attack ... ... "1929 + ‘0374 "3951 + 0594 
Area of scar and severity of attack ...0 ... 0... 2646 + ‘0301 3520 + ‘0584 
Number of scars and severity of attack ... ... "1511 +0306 2323 + 0616 
Strength of resistance and severity of attack ... 8603 + 0150 
Age (division at 20 years) and severity of attack "3693 + ‘0386 ‘3366 + 0715 

} Age( 4, 2°, ) a a e 3297 +0314 

Age ( ” 35 ” ) »” ” ” | 3180 + 0293 


It will be observed that in the recent epidemic the coefficient of correlation 
between vaccination and strength of resistance, while less than in 1892-95, was 
very much the same as in previous epidemics in other towns*. On the other 
hand, the correlation between vaccination and degree of severity of attack is very 
much less than in 1892-95, which points to a marked difference in the character 
of the two epidemics, the earlier being the milder. Also the correlation between 
degree of severity and (1) foveation, (2) area, and (3) number, of scars is less even 
than in 189&-95; but I understand from Dr Brownlee that in concluding whether 
a scar is good or bad, clinically, he would take into consideration its area, 


Biometrika, Vol. 1. No. 3, p. 380. 
Biometrika 1 18 
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the area of foveation, and the amount of depression and “puckering.” The fact 
is also clearly brought out that adults are considerably more liable to the severe 
forms of the disease than the young. 


(3) The next part of my inquiry presents some features which, I think, 
are novel, and is based on figures relating to the Glasgow epidemic of 1900-1, 
supplied by Dr Brownlee. He grouped the male patients according to their 
occupations, adopting the grouping of the Registrar-General’s Reports with three 
modifications: (1) Professions, Clerks, etc. are all grouped together, as the number 
of patients other than clerks was very small; e.g. only four professional men were 
admitted to Belvidere: (2) Railway men are shown separately from Transport 
Service : (3) Shopkeepers include all kinds, instead of the limited number adopted 
in the Registrar-General’s Reports. Cases of doubtful vaccination (25) are again 
excluded. 


The following table is prepared from Dr Brownlee’s figures : 


TABLE IX. 
Smallpox Cases admitted into Belvidere Hospital 1900-1. (Males.) 


| 
Vaccinated 
? | 
Occupations | Unvaccinated | 
| Recoveries | Deaths | Totals 
| 
| | 
| 
Ts aaa wie waa 4 25 3 28 2 
Labourers ... oe 167 25 192 9 
Metal Workers... ... | 159 i4 73 0 
Shopkeepers ... ... 70 LO 80 5 
Railway men... ... 13 2 15 l 
Transport Service... 85 11 96 3 
Other Trades... ... 16 10 56 3 
Spirit Salesmen... ... | Ll 3 14 O 
Professions, Clerks, ete. | 65 6 71 1 
Suilding Trades 74 9 83 | 
Textiles he ees ee 22 l 23 0 
Unoccupied | LO l 1] | 
| 
| | 
Totals... «. | T47 95 |} 842 | 29 | 


From the statistics of Table IX. I wanted to ascertain the correlation between 
the social status of the patients and (1) the presence or absence of the scar, 
and (2) in the ease of the vaccinated their power to resist the disease. For this 
purpose it was necessary to divide my material into two classes having a higher 
and a lower status. No doubt considerable diversity of opinion may exist as to 


the components which should fall into these two classes. I consulted from this 


standpoint the death-rates of various classes as given by the Registrar-General. 
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My chief difficulty arose from the group of metal workers who form by far the 
largest single occupation group after the labourers, and yet exhibit not a single 
unvaccinated person. It is clear that such a class according as it is included 
in the higher or lower status group may completely change the correlation between 
status and vaccination. If the metal workers included filecutters and lead- 
workers, they would have a high death-rate and we ought to put them in the 
group of lower status. On the other hand in some districts the metal workers 
belong to the most trained and the best paid class of craftsmen, men who are 
likely to be well nourished and with fairly healthy homes. I accordingly wrote to 
Dr Brownlee inquiring as to the status of the Glasgow metal workers and as to the 
absence of unvaccinated cases among them. His reply is as follows : 


“No filecutters or leadworkers were included in the metal workers. The latter 
“were in great majority made up of shipbuilding and forge employees, these 
“being the main metal industries in Glasgow. 


A very few brassworkers were 
“ included. 


The status of these metal workers as we received them was not so high 
“as that of either the men who work on the Railways or in the Building Trades, 
“and I think that they should be included in your second class. 


The absence of 
“unvaccinated cases is to a certain extent explainable. 


Most of these workers were 
“from the neighbouring forges where on account of the contiguity of the hospital 
“the employers have for a long time exercised a certain amount of supervision of 
“the vaccination of those employed in the works. Smallpox has several times 
“invaded these works since 1890, so that I think unvaccinated persons must 
“be very few indeed.” 


It must be at once confessed that this more or less enforced vaccination of the 
large group of metal workers is a very disturbing factor in the consideration 
of the relation between status and vaccination. Adopting Dr Brownlee’s view 
I first put the metal workers in the class of lower status. 


As a first grouping I took for those of higher status: Professions, Clerks, etc., 
and Shopkeepers. I obtained: 


TABLE X. 


First Grouping. 











Class Vaccinated | Unvaccinated | Totals 
Lower... ... 691 20 711 
Higher i 151 9 160 

Totals 842 | 29 | 871 


r= ‘1862 + ‘0737. 


I now included in the class of higher status, Miners, Railway men and Building 
Trades as well as Professions, Clerks and Shopkeepers. 


1s—2 
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TABLE XI. 


Second Grouping. 








| Class Vaccinated | Unvaccinated Totals 

| 

| Lower... .. 565 } 16 581 
Higher — 277 | 13 290 

| Totals 842 29 871 








r= *1362 + 0695. 


There is thus for both cases a quite sensible if not very large correlation 
between status and vaccination,—cases of unvaccinated persons occur more frequently 
in the classes of higher than of lower status. 


Putting on one side Dr Browunlee’s views on the status of the metal workers I 
now formed two tables in which the metal workers were included in the group of 
higher status, adding them first to Professions, Clerks, etc. and Shopkeepers, 
as a third grouping, and finally to all these together with Miners, Railway men 
I found : 


and Building Trades as a fourth grouping. 


TABLE XII. 





Third Grouping. 


























Class Vaccinated Unvaccinated Totals 
Lower ... 518 20 538 
Higher 324 9 333 

Totals gia 29 871 

ese 
TABLE XII. 
Fourth Grouping. 

Class Vaccinated | Unvaccinated | Totals 
Lower... ... 392 16 408 
Higher 450 ‘ 163 

Totals 842 29 871 
; r= — ‘0939 + ‘0691. 


The correlation has clearly swung round, and there is now a very slight 
correlation, hardly sensible considering the probable error, between higher status 
In view, however, of Dr Brownlee’s opinion as to the character 


and vaccination. 
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of the metal workers, I do not think this classification is legitimate. Considering 
also that vaccination appears to be more or less compulsory among a considerable 


section of them I determined to omit them altogether, and then obtained the 
following table: 


TABLE XIV. 


First Grouping, Metal Workers excluded. 








: Eee | Sa 
| Class Vaccinated | Unvaccinated Totals | 
| Lower... ... 518 20 538 | 

Higher ee 151 | 9 160 | 
| Totals 669 29 698 | 








v='1167 + 0763. 


Comparing this with Tables X. and XII. we see that there is a sensible 
but small correlation between higher status and unvaccinated condition. 


Now I lay no particular stress on these results because the material is far 
too sparse *, but I believe that the above statistics are the only ones hitherto dealt 
with with a view to determining whether the classes of higher status—presumably 
the better fed and healthier classes of the community—are or are not more 
frequently vaccinated than the lower, presumably the less nourished and less healthy 
classes. No dogmatic conclusion can be drawn from these data, but they exhibit 
no evidence at all for the unvaccinated class being of lower status than the 
vaccinated class; on the contrary, there is slight evidence to show that the 
unvaccinated in Glasgow occur rather more frequently in the classes of higher 
status. 





(4) I turn now to the question whether among the vaccinated there exists 
a correlation between status and severity of the disease. I obtained the following 
tables : 


TABLE XV. 


First Grouping. 











Class Deaths Recoveries Totals | 
LGWEE cco ses 79 612 691 | 
Higher i 16 135 151} 

Totals 95 747 842 | 











r= "0249 + ‘0566. 


* The fewness of unvaccinated cases possibly arises from the fact that during the 1892-95 epidemic 
in Glasgow vaccination was performed on a large scale amongst all classes, so that the epidemic 
of 1900-1 found the great majority of the population vaccinated. 
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TABLE XVL. 


Second Grouping. 








Class Deaths | Recoveries Totals 
} ) 
3 l # } 
| Lower... ... 65 500 565 
Higher ae 30 247 277 
Totals 95 747 842 








r= 0216 + ‘0500. 
That is to say there is a positive but practically insensible correlation between 
status and power to resist the disease. 
I now included the metal workers in the higher status and found with the 
former groupings : 
TABLE XVII. 
Third Grouping. 


Class Deaths Recoveries Totals | 











Lower... ... 65 453 518 
Higher _ 30 294 324 
Totals 95 747 842 





r="1086 + 0490, 
TABLE XVIII. 
Fourth Grouping. 








Class Deaths Recoveries Totals 
Lower... ... pI 341 392 | 
Higher ed Ad | 106 450 | 
Totals 95 747 842 | 








r='1052 + ‘0478. 


Thus the correlation between better status and recovery is slightly increased. 
Leaving the metal workers as before out of consideration, I find: 


TABLE XIX. 
First Grouping without Metal Workers. 


Class Deaths | Recoveries Totals 














[~ ROWER eee ccs 65 | 453 518 | 
| Higher eee 16 135 151 | 
Totals 81 588 669 


r= "0579 + 0594. 
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This is probably the most satisfactory result as about the mean of the previous 
tables. We accordingly conclude that there is a very slight relation between 
status and recovery from attack. 


The way in which the metal workers increase this correlation when they are 
placed in the better class is remarkable; they have in Glasgow a comparatively 
small mortality from smallpox, yet we find from the 55th Annual Report for 
England of the Registrar-General—Medical Supplement that the general mortality 
among metal workers is higher at all ages, especially after 35, than that of 
occupied males. The figures are as follows, if the standard rate of mortality 
among occupied males at each age be taken as 100: 


Ages... oo 1— 20— 25— 35— 45— 


55— 65 and up. 
Metal Workers 105 106 103 111 122 12 


9 128 


It would be interesting to compare the special classes grouped as metal 
workers in Glasgow, with those embraced under the same heading in the Registrar- 
General’s Reports. 


(5) Conclusions. 


(i) The statistical constants for vaccination and smallpox differ sensibly for 
the same place with two different epidemics, i.e. epidemics seem to be differentiated 
in character. 


(ii) The statistics of Glasgow do not indicate that those of lower status 
—and therefore probably worse nourished and housed—provide the bulk of non- 
vaccinated cases. On the contrary there seems to be a slight tendency for the 
non-vaccinated to be of higher status. 


(i) There is a slight although scarcely sensible correlation between status 
and power to resist a smallpox attack. 


The Glasgow statistics do not go very far, but as far as they go they do not 
justify the statement: That the apparent protection of vaccination is due to the 
unvaccinated belonging to classes of lower status which have a far smaller power 
of resistance to the disease than the better nourished classes of a higher status in 
which the members are more generally vaccinated. 


I have not dealt with the statistics as to female patients as the great bulk of 
them are classified merely as “ Housewives,” which throws no light on their social 
position. 


In concluding this paper I venture to express the hope that the statistics 
of the recent London epidemic may soon be issued and in a form which will 
admit of due consideration of the problems referred to in this paper. Their 
magnitude gives them extreme value and their publication is no doubt anxiously 


awaited by a wide circle of scientific inquirers at home and abroad. 
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NOTE. 


From the “Times” of 20 November 1902, it appears that the Medical Officer 
of Health for Islington has prepared his final report on the 1901-2 epidemic so far 
as his own district was concerned, but I have been unable to obtain a copy of it 
The paragraph in the “ Times ”, however, enables me to form the following table : 


TABLE XX. 


Smallpow in Islington, 27 August 190i—29 August 1902. 











Recoveries | Deaths Totals | 
Vaccinated ... 207 29 236 
Unvaccinated... 35 | 30 65 
| 
| Totals 242 | 59 301 | 
| me i ~~ 





¢ = '5744 + 0560. 


This value of r is in extremely close agreement with that obtained for 1017 
cases which were “completed” in London in 1901 *, viz. r=*5779+°0311. To 
show that the closeness of the agreement holds throughout the investigation, 
I give the equations from which r was calculated in both cases: 


For the above 301 cases: 
070608 7° + 002326 2° + °151410 74 + (017030 7° + °336384 7? + » = ‘707743. 
For the 1017 cases: 


‘071607 7° + ‘001780 7° + 149636 14 + 0198442 7? + ‘326092 7? + r = "710100. 


* Biometrika, Vol. 1. No. 3, p. 379. 

















COOPERATIVE INVESTIGATIONS ON PLANTS. 


Il VARIATION AND CORRELATION IN LESSER 
CELANDINE FROM DIVERS LOCALITIES. 


(1) In view of the data for F. ranunculoides published by Dr F. Ludwig* 
and Prof. MacLeod+ and the statistical constants determined for them} it seemed 
desirable to obtain rather more statistical material and a more comprehensive 
series of constants for the purposes of a comparative study. There were two 
points to be considered, namely: (i) the influence of locality and (ii) the influence 
of the time during the flowering season at which the flowers were gathered. 
Unfortunately we were in the present season in no case able to obtain from our 
collectors two series from the same locality with a month’s interval between the 
gatherings. All the collecting except in one case had to be done during the 
brief Easter vacation of our workers, and this did not admit of a double gathering 
in the same locality at a suitable interval. The one exception is that of the 
Bordighera collection. Mr Francis Galton most kindly offered, as he was wintering 
on the Mediterranean, to provide a double series of Lesser Celandine flowers. 
The first series was gathered about February 19th and at once dispatched to 
England. On arrival it was found that practically every sepal, every petal, and 
nearly every stamen had fallen from the flower-heads. This lost us our earlier 
series, but we learnt a most valuable lesson, namely: that transit of any kind, 
even by hand, will cost the flower if nearly full-blown one or more sepals, petals or 
stamens. Our plans had therefore to be changed; each celandine flower was 
now gathered as a bud and wrapped up in a small piece of tissue paper. This 
involved a great increase in the labour of gathering and a much greater one in 
that of counting, a good deal of which had to be done under a lens, but we were 
thus certain of preserving all the parts of the flower intact. Mr Francis Galton 
suggested and carried out this arrangement in a second series gathered between 
March 4th and 7th which reached England safely, but three or four weeks later 
than this there were no flowering celandines to be obtained in Bordighera. 

Mr Galton’s plan was carried out in the further collections made in Guernsey, 
Dorset and Surrey, the collectors gathering the buds, and wrapping them up 

Biometrika, Vol. 1. pp. 11—20. + lbid., pp. 125—128. 
t{ Ibid., pp. 316—319, 
Biometrika 11 
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in situ in a small piece of tissue paper. The buds dried and were preserved for 
weeks, to be counted at leisure. Even if a petal or sepal fell off it was preserved 
in the paper wrapper. We believe it is largely due to this method of gathering 
that our results show such a totally different distribution of sepals and petals to 
those of other observers. 


Take the case of sepals. In the Guernsey series we found only three in- 
dividuals with less than three sepals. On examination under the microscope 
in one of these cases an abortive sepal was found, in another the sepal had 
clearly once been attached, but in the third case the head was unfortunately lost 
before microscopic examination. We think that we may safely affirm that no true 
case of less than three sepals was found in the Guernsey plants. In the Dorset- 
shire gathering no cases of less than three sepals occurred. In the Surrey 
gathering there were in the material six cases of less than three sepals, and 
in five out of these six cases the rule of gathering buds only had been disregarded 
by a young collector* ; and the sixth was an aborted flower in which the stamens 
were not properly developed and there were no pistils at all! In the Bordighera 
celandines there- are only two cases of less than three sepals. These have not 
been excluded from the calculations, because we had not when counting them 
learnt from the Dorset and Guernsey series to regard all cases of less than three 
sepals with grave suspicion and examine such cases under the microscope. But if 
in 624 cases only two such individuals occurred, it seems extremely probable that 
even these were cases of sepals knocked off or aborted. In a total of 2149 heads, 
there were 11 cases of less than three sepals, two of these were aborted flowers, 
five of them were old flowers with parts loose, one had once had a sepal which had 
been lost, and three were not closely examined. The experience we have had 
leads us to believe that each flower ought to be gathered as a bud and at once 
wrapped up. It seems to us that all we can admit is a possibility of three heads 
with less than three sepals in 2149 cases, while Dr Ludwig’s 3000 from Greiz 
show no less than 60 definite casest+. We feel fairly confident that had the 
Greiz flowers been collected as buds and possibly gathered and counted by adults 
instead of school children they would not have differed so widely from our material 
in this respect. As far as our experience reaches, we question the existence 
of any normal flower with less than three sepals. This view may be modified 
when further material from central Europe gathered in bud and, if necessary, 
microscopically examined, is available. We hope next season that this problem of 
the sepal may be directly investigated. 


(2) Material of the present investigation. 


(A) 624 heads gathered in a vineyard at Bordighera, Italy, between March 4th 
and 7th. Due to Mr Francis Galton. The sepals, petals, stamens and pistils 
were counted by Mr N. Blanchard. The tables were prepared and the statistical 


* The caleulator’s notes are: No. 51, ‘‘only one sepal and this hanging loose”; No. 52, “ old flower 
with petals detached”; No. 53, ‘* Pistils loose”; No. 64, “ very old flower, all parts loose”; No. 69, 
“very old flower.” No remarks as to age of this kind occur in other cases, 

+ See Biometrika, Vol. 1. pp. 13—15. 
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constants calculated by Miss Alice Lee. These belonged to a variety of Lesser 
Celandine classed by the French botanists as Ficaria calthaefolia and considered 
by them as distinct from F ranunculoides. Table I., however, exhibits less 


differentiation in the floral parts of F. calthaefolia and F. ranunculoides than can 


occur between two local races of the latter. The Index Kewensis gives the former 


name as a synonym of the latter, but there appears to be a more sensible differen- 
tiation of the leaf. 

(B) 520 heads gathered on roadside banks at St Peter Port, Guernsey, 
between April 25th and 30th. Due to Miss Caroline Herford. The characters 
were counted, the tables formed and the statistical constants determined by 
Miss Mary Beeton. 

(C) 500 heads gathered in two lanes and a meadow near Thursley, on the 
north side of Hind Head, Surrey. The material was gathered between 12th and 
17th of April by a number of collectors under the supervision of K. Pearson. 
The characters were counted by Mr N. Blanchard and the tables and statistical 
constants are due to Dr Lee. 

(D) 505 heads gathered in a lane at Studland, Dorsetshire, on April 7 by 
Mr N. L. Blanchard. The counting is due to Miss Edna Lea-Smith and to 
Mr Blanchard. The statistica! tables and constants are again due to Dr Lee. 

In reducing the material Sheppard’s corrections for the moments were not 
used, partly because we are dealing with variation by units, and partly because 
it is clear in the case of the sepals, and probably true for the petals, that the 
frequency distribution has not high contact at the low end of the range. Other- 
wise the calculation of means, standard deviations, coefficients of correlation, and 
of variation proceeded in the usual manner. 

(3) Table I. gives a summary of all the statistical constants* for the four new 
series and places alongside them those already found for other localities. Now 
this table shows at once the remarkable differences, which period in season and 
environment can have on mean, variability and correlation. Not one of these 
quantities has the least approximation to constancy for all local races of Ficaria, 
nor for the same race at two parts of the same reason. The early Belgian 
celandines, judging from the constants for stamens and pistils, which are all we 
have, are practically identical with those from Italy, while the later ones diverge 
widely and perhaps may be considered nearer to the Swiss Trogen series than to 
any other. <A brief study of this table will at once convince the reader of two 
fundamental points : 

(a) Local races in plants cannot be defined or distinguished by the existence 
of differences many times the value of their probable errors between their means, 
variations or correlations, 

(b) The influences of environment and season are for plants of supreme impor- 
tance and very widely or indeed entirely screen any differences due to local race. 

* These are tabulated to four places of decimals—not because such are exact or necessary for present 
purposes, but because for future investigations when we come to consider the evolutionary history of local 


races, it will be needful to have the constants to this number of places in order to calculate the partial 
regression coefficients true for each series to one or two places of decimals. 
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These results form a further proof, were any needful, that race cannot be 
defined by correlation*. Plants are, of course, more subject to environment than 
animal life, and the means and variabilities change with change of environment 
and of season very readily. Man, perhaps, of all animal life seems least affected 
by environment, but a table like the above showing how different the mean, varia- 
bility and correlation are when there is slight change in locality or season, ought 
to render us cautious of cataloguing even local races of man by the means alone 
of small craniological series. Nurture and environment during growth may 
sensibly influence even human mean characters. The biometrician, far from being 
discouraged by the fact that the simpler biometric quantities—means, variabilities, 
correlations—are not constant for local races, ought rather to recognise that if 
natural selection be a reality these are the very quantities which he would expect 
to change from one local race to another, and from one environment to a second. 
Indeed it is exactly these differences which form the foundation upon which we 
hope to build up in the future the evolutionary history of local races; they are 
the material from which we must determine not only the characters which in 
each case have been selected, but also the absolute constants which define the 
species itself. 





(4) Comparing the first four columns—the new data—with the last four 
columns—the old data for central Europe—one is at once struck with the high 
degree of variability in sepals and petals of the latter as compared with the 
former. The sepals are more than four times, the petals are nearly twice, as 
variable in the second group, i.e. the mean values are ‘6544 and 1:0946 as agains* 
1598 and *5944 respectively. Now we see in the case of the Dorset group that 
the petal variability can rise to double the value it had in Bordighera, Guernsey 
or Surrey. But an examination of the tables below shows that much of the 
difference arises from the absence of individuals with less than 3 sepals or 7 petals 
in the first group, and as our data are drawn from very diverse districts we must 
await with interest the result of bud countings from Germany. 


TABLE II. 
Frequency of Sepals per 1000. 


| Number of Sepals 0 1 » | gs | H y 6 , 
| L | a 
Jordighera . : | — | 32 |os27, 625 | 16 

St Peter Port ae _ | - 990-4 96 

| Surrey.. 990°0 | 10°0 ae 

| Dorset... | 972°3 25°7 2°0 _ 

| 

| ee na eee ts 1°8 90 | 546°8 | 251°7 | 1742 | 115 0-8 
ME oie seer aes 30 0-0 21°0 | 733°0 | 152°0 86°0 5-0 | 

a gen ete. ik —— — - 680°7 | 259°7 52°6 70 | 


Ree hk ees eee one —_ 2°0 91°0 283°0 | 616-0 70 i-0 





* Phil. Trans, Vol. 187, A, pp. 266, 280. 
+ ‘** Mathematical Contributions to the Theory of Evolution. XI. On the Influence of Natural 
Selection on Variation and Correlation.” Phil. Trans. Vol. 200, A, pp. 18—21. 
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Number of Petals} 2 | 4 


Sordighera... | 
St Peter Port | 
Surrey 


ea | 0-2 
. fea 

ry. | 
Trogen we fol 
Gais | 


| 
Dorset a E | =e) 


TABLE III. 
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Frequency of Petals per 1000. 


ct @ b 8 
| 
16 | 14-4 
-- — 13°5 
—| 40 | 40-0 
eh, ea 
6°8| 327 148-0 
160; 31°0 112°0 
35) 140 | 1193 
|--| 20 


34:0 


868°6 


869°2 
858°0 
560°4 


473-0 


563°0 
642°1 
139°0 


96°2 
96°1 
84°0 
231°7 
180-0 
175°0 
150°9 
265°0 





10 11 | 12| 13 | 14 | 15 
dds 
176-16 - 
135 | 77) — | —- — 
100; 40) —|—]|—/- 
124-7 | 63:4 | 79) 40) — | — 
| 
990 35°3 172) 60} 05 05 
62°0 24°0 |11°0 10;0°0 10 
59°7 10°5 --- — —_ — 
296°0 


1750 690 180;20 — 


But making all allowance for this the German and Swiss series show a marked 
increase in the number of both sepals and petals over our series, culminating 


in Gais with its modes of 5 sepals and 10 petals. 


Our Dorsetshire series indicates 


a substantial advance in the same direction as far as the petals are concerned, the 


Bordighera tends somewhat in the same direction for the petals. 


It remains as a 


task for next spring to determine whether the Guernsey and Gais series cannot be 
linked together by a continuous system of intermediate series, if only the flowers 
be gathered at a sufficiently wide range of places and at different seasons in those 


places. 
(5) 


Turning to the correlations which involve sepals one is hardly surprised 


at their irregularity, for their values depend in our new data on the distribution of 


a very few individuals with more than four sepals. 


These for Bordighera are 


less than 7 per cent., for Dorset less than 3 per cent., and for Guernsey and Surrey 
not more than 1 per cent.! Hence one or two irregular or anomalous individuals 


‘ause the correlation to alter in a very remarkable manner. 
of correlations are those between petals, stamens and pistils. 


A much safer set 
Unfortunately the 


counting of stamens and pistils is a much harder task, and for a series of 500 
bud-gatherings requires a week or two of very laborious work. 


Looking at our new data, which are arranged according to number of stamens, 


we see several important and almost uniform results: 


(a) If the different series be arranged in ascending order of stamens, this will 
also be an ascending order for the number of pistils. 


With one exception—Svrrey—this is also an ascending order for the number 


of petals. 


(b) The ascending order for stamens or pistils is also an ascending order 
for the variability of both stamens and pistils. 


petals. 


With the same exception—Surrey—it is an ascending order for variability of 


(c) The ascending order for means or variabilities of stamens or pistils is an 
ascending order for the three correlations between stamens, pistils and petals. 
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The slight exception to this rule—Surrey correlation of petals and stamens—is 
well within the probable error of the determinations. 

Summing up these results in a single statement, we conclude from the data 
that were gathered and counted on one uniform plan that: 


The local races of the lesser celandine which have flowers with more numerous 
parts, have those parts more variable and more highly correlated. 


Let us compare these results with the data obtained by other observers. 
Germany contributes nothing, for stamens and pistils have not yet been counted. 
The Swiss data, if individually a little anomalous, form, if clubbed together, a 
striking confirmation of our argument,—they have fewer pistils and stamens than 
our four series, the variations are less, and the correlations for stamens and pistils 
are low—Gais has the lowest result of all, and Trogen only about the Guernsey 
value. Unfortunately we can only compare one correlation and this has been 
worked out for only small numbers, 285 in one case and 80 in the other. With 
regard to Belgium the comparative results are not so satisfactory. The early 
Belgium flowers have about the same number of stamens as the Italian and fewer 
pistils, the variability of both stamens and pistils is less. The late Belgium 
flowers have the least number of stamens and pistils of any series, their variability 
in both stamens and pistils is less than all four of our series, and less than the 
mean of the two Swiss series. Here again we have unfortunately only one corre- 
lation available for comparison, that of stamens and pistils. It is for the early 
flowers somewhat greater than that for the Italian series, when it ought to be 
about equal to it, while for the late flowers, it is the highest of all when it should 
be the least from the above rule. But just in this material a new factor comes 
into play, which, if not the source of the result, deserves to be taken into account. 
The late flowers are said to be from the same plants as the early flowers. But 
pistils and stamens cannot be counted unless the flower be gathered. Hence the 
plants from which the later flowers are gathered may be heterogeneous in character, 
namely plants naturally flowering late, and plants which have been more or less 
injured by one or more flowers being removed and which have reflowered in 
a more or less exhausted condition. Such heterogeneity would tend to increase 
correlation. Clearly in this as in so many cases the conditions are very complex, 
and we cannot without further investigation club together flowers from plants 
(a) which have had earlier flowers gathered, (b) which naturally flower late, with 
the late flowers of plants which have been flowering throughout the season 
undisturbed, and term the group ‘late flowers’ without danger of introducing 
heterogeneity. Till therefore the whole subject of the correlation of late and 
early flowers has been gone into carefully with ample data, we may take, we hold, 
the above general principle—that increase in the number of parts denotes increase 
in the variability and correlation of those parts—as a working hypothesis to 
be demonstrated or modified by further researches. We, of course, do not 
extend the principle beyond the species for which the data are deduced, even as a 
“ working hypothesis,” and we state it only for petals, stamens and pistils. 














152 Cooperative Investigations on Plants 


(6) It may be useful to examine the interracial result from the correlation 
standpoint, although at present we have rather meagre data. We have eight local 
series ; considering them of equal weight, although the Belgian and Swiss series 
are very inferior in number to ours, we find : 


Mean of local means for stamens 


ine 26498 + 1321 
Mean of local means for pistils st — = 19°432 + 1:265 
Variability of local means for stamens we = 5°5409 + 9343 
Variability of local means for pistils 5°3037 + 8943 
Correlation of local means for stamens and pistils = “9524 + °0235. 


ll 


ll 


ge 


The probable errors are very high because we have only eight pairs. It is there- 
fore difficult, perhaps, to make a definite statement that the interracial is greater 
than the average intraracial variability, but it appears like it; even the relative 


Lesser Celandine. Interracial Regression Lines. Stamens and Pistils. 
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Number of Stamens in Racial Mean. 


order of variability of pistils and stamens may be changed when we pass from 
intraracial to interracial constants. On the other hand the correlation is so high * 
that we can make a very definite statement about it, i,e.: 

The interracial correlation of the mean numbers of stamens and pistils is very 


much greater than the mean intraracial correlation between stamens and pistils, 
being to the latter nearly in the ratio of 12 to 7. 


* And therefore its probable error even with only eight pairs, so small. 
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These conclusions show us how extremely cautious we must be in extending 
conclusions deduced by intraracial calculations to interracial relations. Intra- 
racial and interracial variabilities and correlations may be of a totally different 
order. By the usual theory of regression we have the following formule to find 
the probable number of stamens in a race given the mean number of pistils, and 
the probable number of pistils given the mean number of stamens: 


= ‘9939 x (observed mean number of 


Probable mean number of stamens in . 
pistils) + 7-185. 


local race 


Probable mean number of pistils in a) =*9106 x (observed mean number of 
local race stamens) — 4°697. 


Comparing with actual values we have: 


Stamens from Pistils Pistils from Stamens 
Observed Calculated Observed Calculated 
Bordighera 26°7 249 177 19°6 
St Peter Port 280 31:0 23:9 20°8 
Surrey 32°9 31°7 24°5 25°3 
Dorset 35°6 353 28:3 277 
Early Belgian 26°7 24°5 174 19-6 
Late Belgian 17-9 19°3 1271 116 
Trogen 20°4 20°4 13:3 13°8 
Gais 23°8 25°2 18:1 17:0 


The maximum error in the number of stamens is 3°0 and the mean error 13; 
for the number of pistils it is 3°1 and the mean error again 1°3. Thus our 
formule will give the number of stamens from pistils, or pistils from stamens 
in any race with an average error of about one pistil or stamen, and a maximum 
of about three. They are thus close enough for most practical purposes. The 
diagram gives the lines to read stamens from pistils and pistils from stamens. 


(7) There are a number of other points which are suggested by our table, 
but it seems better to defer their discussion until more ample data are forth- 
coming. There is hardly any district from which at present a series of 500—600 
celandine heads would not be of value. But these series ought to be gathered on 
a uniform plan, ie. gathered as buds, each bud being wrapped in a separate soft 
paper cover. Further since the labour of counting is very great, those who will 
count as well as gather are the more valuable helpers. In counting great care 
must be taken to separate the individual pistils, and all heads with less than 
three sepals ought to be carefully examined with a microscope or powerful lens. 
If possible two series should be taken from each locality, one early and one late in 
the season, but the flowers should not be taken off the same plants, but off plants 
having substantially the same environment. A record of date, environment and 

Biometrika 1 20 
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locality should accompany each series. The most important localities at present 
seem to be, Mid-France, South and Mid-Italy, Mid and North England, Switzer- 
land and South Germany in as many districts as possible, and any northern 
European stations, where workers may be available. It would be of great value 
to have one species of plant thoroughly worked out from the biometric standpoint 
over a wide area. This can only be done by cooperative labour. 


CORRELATION TABLES. LESSER CELANDINE. 


A. SEPALS AND PETALS. 


























I. Bordighera. II. Guernsey. 
Sepals, Sepals. 

a SE Ee e 
2 | 3 | 4 | 5 [Totals 3 | 4 |Totals| 
6 1 | a, ee I 7 6| 1 7 | 
a 7 3} 6] — 9 ‘a 8 | 450| 2 | 452 | 

= 8 | 2 | 521 | 19 | — | 542 . 9 49 | 1 50 
® 9 —| 46| 14 60 = 10 7|— 4 

P| 100 f—| wol]—]1 11 Ay | 11 3/1 4 
11 —- l1j—i|— l | | 
i Totals] 515 | 5 | 520 | 
Totals} 2 | 582 | 39 | 1 | 624 | 








III. Surrey. IV. Dorset. 





























Sepals. Sepals. 

| 3 4 |Totals| | 3 4 5 {Totals 
6 2 = 2 | 7 4|— 7 
7 | 20 | — | 20 ; 8 |279 | 4 283 | 
a 8 14299 | — | 499 a 9 | 115 2 | — [117 | 
& 9] 38 | 4 42 | S| 10 | 59 A 63 
fj 10 ,) = 5 | wy 11 31 1 — 32 | 
11 1| 1 2 | 12 eh & 4 | 
See oe 1 2 | 
Totals] 495 | 5 | 500 | | 
ES. ae | | Totals 491 13 1 | 505 | 
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SECOND REPORT ON THE RESULT OF CROSSING 
JAPANESE WALTZING MICE WITH EUROPEAN 
ALBINO RACES. 


By A. D. DARBISHIRE, Balliol College, Oxford. 


THE present paper is an account of the continuation of work, the first part of 
which was published in the last number of this Journal. It contains, first, an 
additional record of the results of crossing Japanese waltzing mice with albino 
mice, embodying the previously recorded nine families and adding eleven new 
ones; and secondly the result of pairing hybrids resulting from such crosses, and 
of crossing these hybrids with albinos. 


Crosses between Japanese Waltzing and Albino Mice. 


The number of young obtained from such crosses has increased since the last 
yaper from 48 to 154; but the uniformity of the result (the almost univers 
ri 48 to 154; but tl f ty of tl lt (th Imost u al 
yresence of patches of colour like that of the house-mouse) has not been main- 
f patel t col like that of the | yh t been maii 
tained: there have appeared besides more yellow mice, black mice, and black and 
white. 

The classification of colours in the hybrid has therefore had to be re-organized. 
The hybrids nearly always shew a considerable amount of white: they are now 
classified according to the amount of whiteness into five groups: each group 
is then subdivided into five classes according to the colour itself. 

Group 1 (Fig. 1) has more white and less extent of coloured patches than the 
normal waltzing mouse. The distribution of colour on a waltzer is shewn in 
Fig. 6. 

Group 2 (Fig. 2) has about as much white as a normal waltzing mouse. 

Group 3 (Fig. 3) has less white (i.e. greater extent of coloured patches) than a 
waltzer. 

Group 4 (Fig. 4) has still less white and leads on to 

Group 5 (Fig. 5) which has no pure white: but the belly is whitish not gray. 

Group 6 contains mice whose bellies are nearly the same colour as their 
backs. A gray mouse of group 6 (6c or 6d) is therefore indistinguishable from a 


house-mouse. 
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The individuals of each group are then classified according to the colour of the 
coloured patches. 





Class a = yellow. ‘ 
Class b = fawn yellow. 
: ; 
Class ¢ =pale wild colour) _. 
: Le. that of the house-mouse). 
Class d=dark wild colour ( ) 
Class e = black, 
and Class f= “lilac” = pale blue gray; at present only exhibited by the offspring 
of hybrids. 
It is not suggested that these form one colour series. Table I. shews the 
distribution of colour and the colour itself in all the individuals of twenty families 
produced by crossing waltzing with albino mice. All the mice in the Table have 
black eyes. 
TABLE IL. 
| | i ee | 
| Number| Group L Group II. Group III. Group IV. Group V. Group VI. 
of ; 2S Saree : af.) Totals 
Cross lalolceldje|a\|bleld | ela|lb\e|diela b\el|d\elalb eld ela bleldle 
> i EE eam geal as (ey ne gee Waeia (ee R Pe Gee ae | ae -|-|-j- my ee (= -|-}- a 
ii |- -i-|}-}-|-|1]-|-|-|-|-|-|-]|- -|}-|-|- -]-|- | - 1 | 
vii -}-}-| 2/1] -|1l)/1]-j- -|- -|-|2})-|-]--|-|-|-| 7] 
vii |-|-|-|-|-| - | Ee ae ee -|-1|-|-]-|-|-|6]-| -|-|-I- 6 | 
ix |- = Seat ee ee a = ey Sa fees (a ee (ay Ee - 6 | 
Xii |- = -|— - - - = | _ - - _ - = = = = = 2 _ - 3 = = - 5 } 
xiii |-|-|-|-|-| -|-|-|-|-]-|-|31-|-|-|- “f= |= 1-8] 5) =| 4 a 
j | xvi |-|-|-|-|-|-|-|-|-]-]- 2 -|- Lj-|- a -j-|-|-|-| 3 
xx {-|-|-|-|-| -|-/1]| 2] -|-j|- -|- - - 2|-|- -| 6 
xxvii® |-|-|-|-|-| -| -|-|-]1]-|-|-|]-|-|-|- Sa ft j= pia pei ae 
xxxiii# |-|-|-|-|-| -|- -|-]-]-!/-|-]-|]-}-]-!-]-]-)-]|-\1]-|- -|-|-} 1] 
xxxv [-[-/-j-i-|-|-/1)1 = Z\|-|- -|2|-|- | - 6 
A oh Sad taal bes Sak ed ek 2 eet l- -|-|-)-]- =) es li be 4 | 
lix® | : =f ==, [ei - -{lj-|-]|- cat (cael ‘oon | - 1 2 | 
Ixi |-|- t—}=1a}-)-}-1=fa]- - 1|-|- 1 | - - 7 
Ixii | -J-|-}-|1]/-|-l-|-]8|-|-|- a1 -|-|3|-j|-|- 6 | 
Ixxiii_ | af bead Cael cicada hat cae ici a 19) (eae ea We) = = : 4 6 
Ixxiv {|-|-|-|-|-| -|- -j|/-|-j-|1 -i- 6 me ss ie i lon ‘ 
Ixxv |- -/1 2/-|-|-|-|4a}-j]-|- 1|-|-|- | 5 
lxxvil -|-|-|- -|- - - | -|;-j|J-/];-[- -|/2)- l = 4 1 a bs 8 
Ixxviii | - -j-|-|} -|}-}-|-/J1]-;-]-|-|1]- -}-|]-|-|-|1)1]- the fg 4 
Ixxxii* |-|-|-|-|-|-|-|-|-|-|-j|-|1 -|- -|-|-j|- 1] - -|-|- 2 
Ixxxiv j|-)-|-|-|-| -|-|-j-]| -~-|-|-|-j-|/-|-|]-|-|-|5]-| 4]-]-]-/-I-]-]- 9 | 
Ixxxv - -|-| - - - aa | — - - - 2 - - - 1 ~ -—ji-|=- 3 1 -|- -7 71 
Ixxxvi |-|-/|-|-|}-| -!-!-|/-]J-]-|;-]2/-]-j- —-}-{|-|-| 5]-]- {-j-|-]-]- 7 
xeiv® |-|-|-|-|-| - “i teal tod |-|-|- -|-|-|-|-|-]-]-|-:-]-l-]-| 2 
e |-\-|-|-|-|-|-|1)-|-|-)-|38)-]-|-|-|-|-|-|-/-|5]-|-|--|-|-|-| 9] 
ai j-|-|-|-|-|-|-/1 -|- -|-|2)-|-)-]-|1]-|- wile? Gad ed saa fed 9 | 
ei |-|-|-|-|-|-|-|1)-]-|-|-|3 at tail tos he es es tes sot allot ‘a 8 | 
’ |— i) ——- r— | 7; 
Totals |-|-|3|-|-| -|- a1] 7|2)-~-|-la7|/5|3)|-|- 12) 3) 1) 7 )-|49| 10) 3 -|-|-|1/-] 154 





* In these crosses the parents were, ? waltzing mouse x ¢ albino; in all the rest the parents were ¢ albino x ¢ waltzer. 
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From the following re-classification of the above Table I. the frequency of 
degrees of whiteness among 154 hybrids will be at once evident. | 
UE vccetsntenenss 3 | 

WOR SS ccensvcsiecenes 30 , 

Gite F cccscivinsncee 

SPUR TE awe cevedveoes 16 

SPUN Os kcctccnnenssncs 69 

A sis csarncszeves 1 ‘ 
154 


That is to say the majority of hybrids have coloured patches in, roughly, the 
same place as the waltzers have; or are wild-coloured mice except for the belly. 


The following arrangement shews the frequency of colours among 154 hybrids. 





CIAEE Risin vs cccoveeves 7 
| COGS Bia incicecveceues 0 
/ NE 0 xsteccccicicees 112 
CAMEO Cexciccererseeds 26 
CONE O cccess ctcexns 9 
154 


That is to say by far the greater number of hybrids exhibit wild colour. 


Offspring of Hybrids. 
Hybrids have been paired inter se and crossed with albinos. 


A. Offspring of pairs of Hybrids. 





The number of mice which waltz in this generation is eight. At present nine 
albinos have appeared: the actual arrangement of the colours and the colours 
themselves are here shewn: the letter w indicates that the mouse exhibits waltzing 
movements, and p that it has pink eyes. It is unnecessary to say that albinos 
have pink eyes. 

H,. % 2dx df Ze. 
3 Young. 2f(wp), 2a (p), 2b. 
H,. 2lex J 2%. 
4 Young. 3b(p), 3c, 4a(p), 4a (p). 
H,. 2 2dx Jf 3c. 
5 Young. 3a, 3b, 3b, 3e(w), 3f. 
Hy 2 2x J %X. 
7 Young. 4 albinos, 3b(wp), 3b(p), 3c. 
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Hy 2 2xdJ 3c. 

7 Young. 2 albinos, 2c, 2c, 2c, 2c(w), 4e (w). 
x, 2 texd 2c. 

6 Young. Albino. Albino(w), 2c (w), 3c(w), 2e, 2f 
Hy 2 2x Jf 3c. 

5 Young. Albino, 2e, 2c, 4e, 4d. 


That is to say there are 37 young from 14 hybrid parents of degrees of 
whiteness included in Groups 2 and 3. The young may be classified according to 
degrees of whiteness and according to their colour. 


Degrees of Whiteness Nature of Colour 
So en 9 SE encremarees 9 
ft 0 2. none + 
NE DS cdccivcciass 12 BED. secienconen 6 
ae 11 og ene 9 
GES seveaevscnes 5 CRE OS sensccsstees 1 

37 | oe 5 
-” ie gree 3 


B. Offspring of Hybrids and Albinos. 
The icllowing is a record of young produced by crossing a hybrid with an 
albino. 
Hy,  albinox 3c. 
4 Young. 5c, 5e, 5e, 5e. 
H,.  albinox 3c. 
9 Young. 5 albinos and 4e, 4d, 4d, 5d. 
H,. 2 albino x # 5c. 
5 Young. 2 albinos and 5c, 5d, 6e. 
H;. albino x ¥ 5c. 
5 Young. 1 albino and 6d, 6d, 6d, 6c. 
H,. Qalbinox ¥ 5c. 
4 Young. 2 albinos and 6d, 6d. 
Hy. albino x £ 3d. 
6 Young. 5 albinos and 6e. 


Hy. 2 2cx ¥ albino. 
5 Young. 2 albinos and 3c, 6e, 6e. 
H,;. 3 5ex albino. 


3 Young. 1 albino and 5d, 6e. 
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170 Second Report on Cross-bred Mice 


Hy. % 5cx f albino. 
6 Young. 4 albinos and 2c, 3e. 
H,;. § albino x ¥ 5e. 
8 Young. 4 albinos, 3a, 3a, 6a, 3e. 
Hy. @albinox 5c. 
5 Young. 2 albinos, 3e, 3d, 6d. 
Hy. albino x ¥ 5e. 
6 Young. 3 albinos, 4d, 4d, 6d. 
Hy. § albino x ¥ 3c. 
4 Young. 1 albino, 6e, 5d, 5d. 
Hy. 2 5cex £ albino. 
2 Young. 5d, 5c. 
Hy. 3 5dx f albino. 
1 Young. 1 albino. 
Hy. $$ 5ex f albino. 
6 Young. 2 albinos and 6c, 6c, 4c, 6e. 
Hy. § albino x £ 5e. 
9 Young. 4 albinos and 6e, 6e, 6e, 6e, 4c. 
That is to say there is a fairly sharp segregation into albino and wild-coloured 
mice. Out of 88 young there are 39 albino, 31 wild-coloured, 15 black and 
3 yellow mice. 


Discussion of Results. 
1. The First Generation. 


As Table I. shews, the first generation has not a uniform colour: any modi- 
fication of Mendel’s hypothesis involves the uniformity of the first generation. 
For it is one of the Mendelian principles that segregation of characters does not 
occur until the generation produced by pairing individuals of the first (hybrid) 
generation. The generation, thus produced, essentially consists of 25 °/, organisms 
with the recessive character and 75°/, with the dominant; and this not only in 
the case of a simple character, but in that of a complex one; for example the 
hybrids produced by Mendel’s crossing a white flowered with a purple flowered 
(a complex colour according to Mendel) bean were all purple. But the offspring 
produced by the pairing of these purple hybrids exhibited the most marked 
heterogeneity. The essential point is that segregation of however complex 
characters never occurs before the second generation*. 

Now in these mice this segregation seems to take place in the first generation, 


for there appear in it besides the wild colour, yellow and black, as can be seen 


from Table I. 


* G. Mendel: “ Versuche mit Pflanzen-Hybriden,” Verhandl. des Naturforsch. Ver. Briinn, rv. Band. 
1865 (Abhandlungen), pp. 34, 35. 
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The view that albinism is “recessive” in Mendel’s sense implies that albinos 
of any ancestry, provided they themselves are really albino, will behave in the 
same way when crossed. Now, the young of all the albinos are not the same and 
at the end of my last paper it was pointed out that a difference in the kind of the 
litter corresponded to a difference in the ancestry of the albino mother: that is to 
say, that wild-coloured mice only appeared in the litters of pure-bred albinos: 
and the generalization that the more in-bred a mouse was, the less power it 
had of transmitting its whiteness, was suggested. The evidence brought forward 
for this were the results of von Guaita who in four crosses between waltzing mice 
and albinos which were in-bred for 29 generations always got absolutely wild- 
coloured mice; aud my own results consisting of nine such crosses. This 
generalization is borne out by the additional eleven litters which have appeared 
since my last paper was written; as a classification of litters into (a) those 
from cross-bred albinos and (b) those from pure-bred albinos will readily shew 


(Table IL). 








TABLE IL. 
nat @: Lit i i | 
Group 1 | Group 2 | Group 3| Group4 Group 5 | Group 6 
| eit e | | | 
| (a) Young from cross-bred albinos | 2 11 S|} 4 qi = 
| (6) Young from pure-bred albinos | — 13 2 | — 60* | 1 





In my previous paper I think I did not sufficiently emphasize the difference 
which seems to me to exist between in-bred and pure-bred mice: the above table 
is certainly of pure-bred mice; but what is not certain is whether the relative 
inability to transmit whiteness among pure-bred mice is due to the fact of their 
being pure-bred, or in-bred. Pure-bred mice usually are in-bred and von Guaita’s 
unquestionably were; but this does not help us to decide whether the relative 
inability to transmit whiteness is due to in-breeding or pure-breeding; it only 
shews that it is not impossible that it may be due to in-breeding. The meanings 
of these terms should be clearly understood ; there are two pairs of categories, one 
pair in which cross-bred is antithetic to pure-bred, and another pair in which 
out-bred is antithetic to in-bred, the former pair referrmg to the presence or 
absence of any other colour than white in the ancestry of the mouse, the latter 
pair to the distance or nearness of relation of the parents, grandparents, etc. of the 
mouse under consideration. And before the cause of the great preponderance of 
the number of wild-coloured mice in the litters of pure-bred over the number 
of those in the litters of cross-bred albinos can be ascertained waltzing mice must 
be crossed with in-bred and out-bred pure-bred albinos and in-bred and out-bred 
cross-bred albinos. (Such kinds of albinos are being reared as quickly as possible.) 
Whatever the explanation be, the fact remains that the ancestry of a white mouse 
does make a difference in the character of its offspring when crossed with a 
waltzing mouse. 

* The 60 in the table includes the 7 yellow mice 5a in which there is no whiteness. 


22—2 
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2. The Second Generation. 


It will be seen by inspection of the tables recording the offspring of hybrids, 
that when a hybrid is paired with a hybrid the offspring form a much more 
heterogeneous collection than does the offspring from a hybrid and a white in 
which the young are more sharply segregated into white and wild-coloured. And 
this difference is coincident with, if not causally connected with a difference in 
the ancestry of the two sets of young. For the heterogeneous offspring of two 
hybrids have a more complex ancestry than do the less variable offspring of 
a hybrid and an albino; and it may be a fact of similar meaning that the hybrids 
produced by crossing a waltzing mouse with a cross-bred albino are more hetero- 
geneous than those produced by a similar cross in which the albino, however, was 
pure-bred (see Table II.)—in both cases the more heterogeneous collection of 
offspring comes from parents of which the ancestry is more complex; that is 
to say the young of the cross-bred albinos are more heterogeneous than those 
of the pure-bred; and the young produced by pairing two hybrids are more 
heterogeneous than those produced by crossing hybrids with albinos. 

No theory of compound allelomorphs such as that put forward by Mendel will 
account for this striking difference between the variability of the two groups 
of offspring. 

Remembering that 6c or 6d indicates a mouse indistinguishable from the 
common Mus musculus, the curious fact will be noticed that only one such 
appeared among 154 hybrids of the first generation, whereas out of the 88 offspring 
of a cross between a hybrid and an albino there were *0. This result may be 
parallel with that obtained by Darwin* when he crossed a white fantail with 
a black barb and also a barb with a spot and then crossed the mongrel bard fantail 
with the mongrel barb spot and got “a bird of as beautiful a blue colour, with the 
white loins, double black wing-bar, and barred and white-edged tail-feathers, 
as any wild rock-pigeon!”; i.e. reversion, in both cases, did not occur until the 
third generation. 

The facts so far observed which are in possible accordance with some form 
of Mendelian hypothesis are (1) the apparently regular appearance of albinos 
when hybrids are crossed with albinos, although the eviaence at present available 
does not suffice to shew whether these occur in Mendelian proportions (50°/, 
albino and 50°/, hybrid) or not, (2) the well-known fact that albinos of any 
ancestry when paired together produce albino young, exceptions to this rule being 
at least very rare, and (3) the appearance of waltzing and albino mice in the 
second hybrid generation. 

The first of these results although not inconsistent with the truth of Mendel’s 
hypothesis cannot be taken as proof that this hypothesis applies; for a similar 


result is observed in such cases as that of human eye-colour where Mendel’s Laws 
have been shewn not to hold+. 


* The Origin of Species, pp. 17, 18. + Karl Pearson : see below, pp. 213 et seq. 
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Postscript, added Feb. 12, 1903. 


The mice, resulting from_a first cross between albinos and pink-eyed waltzing 
mice, are now 203, all the individuals having dark eyes, while none have wholly 
white fur, and none waltz. In the second generation, there are 66 mice produced 
by pairing hybrids; of these 13 are pink-eyed albinos, and 17 have pink eyes and 
coats more or less coloured. In the same generation there are 205 individuals 
produced by crossing albinos with hybrids ; of these 111 are albinos, the remainder 
having dark eyes and some colour in the coat. 


The proportions of albinos and of individuals not albino are not in disagree- 
ment with Mendel’s results, but the exhibition of these results in Mendelian form 
depends on the adoption of a quite artificial category of coat-colours ; for albinism 
includes only pure whiteness of coat, while coloured coats include the whole range 
of conditions from white with small patches of pale yellow to dark “wild colour” 


or black. 


The inheritance of eye-colour is not in accordance with Mendel’s results, For 
since pink eyes occur in particoloured mice, the possession of pink eyes must on 
Mendel’s view depend on a separate embryonic element from that which deter- 
mines coat-colour. Pink eyes are however not “dominant,” since the two pink- 
eyed parents of the first generation always produce dark-eyed young. For the 
same reason pink eyes are not “recessive.” Yet although pink eyes disappear in 
the first generation (the result of crossing two pink-eyed parents) they reappear 
in the second; but a correlation is then established between coat-colour and eye- 
colour which is strong in the offspring of hybrids paired together, and at present 
perfect in the offspring of hybrids and albinos. The behaviour of eye-colour is 
thus in every respect discordant with Mendel’s results. 








NEW TABLES OF THE PROBABILITY INTEGRAL*. 
By W. F. SHEPPARD, M.A., LL.M. 


Description of the Tables. 


1. THE “probability integral” expresses the area of the normal curve, or 
curve of error, whose equation is 


s= : FOF siatbows sawobiael iaauls ntwscn See 


V2Qar 
The abscissa # is measured from the central ordinate, about which the curve 
is symmetrical, and its unit of measurement is the standard deviation (square 
root of mean square of deviation). The whole area of the curve is unity. If this 
whole area be divided by the ordinate z, at distance w from the central ordinate, 
into portions 4(1+a) and $(1—a), then 


b(l+a)=]" zdx, 4(1-a)= ” he potas sscepoa ae a2), 
a=2[ zda wonwewes paveuwarevenaeateecds ooe(2A), 


where z has the value given by (1). 


Tables I. and II. give the values of z and of $(1 +) for any value of x from 
‘00 to 6:00; and therefore, the curve being symmetrical about the central ordinate, 
they enable us to determine the values of $(1+ a), $(1—a), and z, for any value 
of x between — 6:00 and + 6°00. Tables III. and IV. give the values of # and of z 
for any value of a from ‘00 to ‘80; and therefore for any value of a between 
— ‘80 and +°80. 

If X denotes the measurement of an organ common to a large number V 
of individuals, and if the different values of X are distributed according to the 
normal law about a mean value m with mean square of deviation o*, then the 


* [In the prospectus of Biometrika, the Editors promised to provide “numerical tables tending 
The first instalment of such tables by Mr Palin Elderton 
This second instalment provides for the widely felt want of probability integral 
and normal curve data calculated on the basis of the standard deviation and not on that either of 
the modulus or of the probable error, Enprrors.] 


to reduce the labour of statistical arithmetic.” 
was given in Vol. 1. 
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numbers of individuals whose measurements are respectively less and greater than 
any particular value X are NV, and N,, where 
N,=4N (1+), Y,=4N (1-2), 
and the ordinate of the figure of frequency corresponding to the measurement 
X is 
Nz/c, 


where }(1+4a) and z are the values of the area and of the ordinate given by 
Tables I. and II. as corresponding to 


x =(X —m)/o. 


Method of Using. 


2. Interpolation. If a quantity w be tabulated in terms of , and u, be the 
value of wu for =a, then its value for c=2,+ 6h, where h is the common 
difference of «, and @ is less than 1, is usually found from ~ and its advancing 
differences by the formula 

U=Uy+ OAw — ~— A?u + 6(1— —— Bla oe decane (3). 

An alternative formula, in which central differences are used, is that recently 
given* by Professor Everett. When we do not require to go beyond third 
differences, this formula is most conveniently written 





6(1— @). 1—¢? 
UW = Uy + OAu, — ( —— Ou, — Sal Ditka’. cacecasdatsuascors (4), 
where &u, and 6*u, are the second central differences of wu, and uw (i.e. Su, = A%x 


Su, = A*u_,), and ¢6=1- 98. 


0» 


In practice, when second and higher differences have to be considered, and a 
mechanical method of performing multiplications is available (e.g. by means of a 
Brunsviga), it is more convenient to use differential coefficients. Let the means 
of the pairs of odd differences above and below the line of u be denoted by 
pdUy, MOU, ... (ie. pdu =F (Aw + Au_,), wd uy = 4 (A®u_, + A®w_,), ...), and let 
Du, D?u,, D’u,,... be the values for « = a, of the successive differential coefficients 
of u with regard to #; then+, & having the same meaning as in (4), 


hDuy = pduy — 4 pS uy + By wu —... 
h? Duy = Sty — Ay Sy + 


WD uy, = pS uy — tus uy, +... stsninie aiaiatain «+(5), 
h*D*u, = Su, — ... 
etc. j 


and, for «=a, + 6h, 
U= Uy + O [hDu, + $0 {[h? Deu, + 40 (Ae Doug + ...)}]ecceeeeees ve++ (6). 


* Journal of the Institute of Actuaries, Vol. xxxv. p. 452. 


+ Proc. of the Lond. Math. Soc. Vol. xxx. p. 465. 
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We need only use inside the square brackets the numerical values of @ and of 
hDu, h? Du, ..., if we put the sign of 8.hDu, outside, and follow the rule of 
signs inside. The rule is that a minus sign must be inserted, if @ be positive, when- 
ever there is a change of sign in the series hDu, h?D*u,, ..., or, if 8 be negative, 
whenever there is no change of sign in this series, 


Thus, in Table I., taking w to be $(1+4), we have for a, =*40 (omitting 
decimal point) 


pduy & uty pO Uy 
368264 -147 —3 
hDu, h? Du, he®D*u, 
36827 — 147 -3 


and therefore, for «= ‘40 +010, 
4 (1 +a) = 10-7 (6554217 + 6 [36827 — 40 (147 +10. 3})). 


For #«="40 —‘016 we should replace + 0, —}6, +40, by —0, +48, —10. The 
formule given by (3) and (4) would be, for «=*40 + ‘010, 








4(1+a)=10~ (6554217 + 367530 + 150 a 6_,f0- _ - ) . 





§(1+a)=107 (6554217 +367530 + 1509 [ +147 eC e)\ 


There is no difficulty about the divisions represented by the coefficients 


3, 4,... of @ in (6); but, if we wish to avoid them, we may calculate )?D*u,/2!, 
h?D®u,/3!, ... and write the formula 


u=h, +O [hDu, + 0 {Ah?D uy + O (ARP D uy + ...)}] cceeesseeeeee (7). 


) 


3. Inverse Interpolation. In most cases N, and V,[=4N (1 +a) and4N (1 —a)] 
are known, and we require a If 4(1 +) and $(1 —a) are both less than ‘90, we 
can use Table III., which gives # in terms of a=(N,—WJN,)/N. But, if either 
$(1+a) or $(1—@) be greater than ‘90, we must use Table I, by inverse 
interpolation. By (6) we have, if # =«,+ 6h, 


0 = (u — mH) + [hDuy + $0 {h?D uy + 46 (RED rig + ...)}] ce veseeeeeeeee (8), 
or, if c=, — Oh, 

0 =(u,—u) + [hDu, — $0 {h? Duy — 40 (h? Dou, — ...)}] ...0:0eene (BA). 
The value of @, and thence that of «, is obtained by successive approximations. 


Suppose, for instance, that 


4 (1+ a) = 654. 
If «='40 — ‘010, we have from Table I. 
6 = 14217 + [36827 + 40 (147 — é}]. 


—»——__- -—-— ~- 


—_—— 
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A first approximation gives 
@ = 14217 + 36827 = °38605; 
and, with this value, the corrected divisor becomes 
36855°2, 
which gives for a second approximation 
= °38575, 
and therefore 
av = ‘3961425. 
The correct value, as given by Table IIL, is 
wv = "3961424. 

The degree of accuracy with which « can be obtained by this method depends on 
the relation of magnitude between the differences of « and of vu. In the above 
example, to a difference of ‘01 in w there corresponds a difference of very little 
more than one-third of ‘01 in $(1+a); and therefore, if # is calculated from 
Table I. to seven places of decimals, it will only be accurate within about 2 in the 
last figure. The possible inaccuracy of # increases as 4 (1+ a) increases. But this 
is not important, as the “ probable error” of #, for any given number of observations, 
also increases. 

4. Smoothing. In arranging a table, with differences, for the calculation of any quantity u, 
it is usual to enter in the difference-columns the actual (or “ tabular ”) differences of the values 
of was tabulated. In the present tables I have adopted a different method, and have given the 
differences as near as possible to the differences of the true values of wv. The object of this is to 
enable greater accuracy to be obtained when required. If we only want w to five or six places 
of decimals, it is immaterial whether we use the tabular or the corrected differences. But, if we 
wish to have it as accurate as possible, we can alter the tabulated values by inspection. 

Looking, for instance, at the commencement of Table I., it is clear that the tabulated values 
of $(1+a) are too great for v='01 and «=-03, while they are too small for c=*02 and x=-04. 
Taking w=} (1+<a) x 10’, so as to omit the decimal point, the table may be written 








A A? | 
2 u i oe | 
‘00 ~| 5000000 39894 — 6 0 
‘01 5039894 — 6 39890 — (1 - 6—¢) 4+(1-—26-¢) 
02 | + 5079783+¢ 59882 - (+x) | 8-(1-08-2- ) 
03 | 5119665-x | s9870-(1- y-) | 124+(1-p-2x-p) 
‘O4 5159534+y : : 


By tabulating w by differences of ‘05 or ‘10 in 2, it will be found* that the third difference in 
Table I., for these values of x, is almost exactly 4. We see therefore that 6+ and xy+w 
are both greater than 4, and @+y is less than 4; while 6, 1-6-¢, p+ x, and 1-y—w are 
all very nearly equal. The values 6=4, p='2, y='2, y="4 satisfy these conditions ; and, as a 
matter of fact, they give for } (1+) values which are correct within 1 x 10~*. 

* For the relations between differences of u for large and small differences in x, see Proc. Lond. 
Math. Soc, Vol. xxx1. pp. 468—471. 

Biometrika 11 23 
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Examples of Application. 


5. Eapression of datu in terms of x When a distribution is nearly normal, 
we may state the data by expressing X (see § 1) in terms of « For an example, 
take the head-breadths of 3000 criminals, given on p. 214 of Biometrika, Vol. 1. 
The interval in X is ‘1 of a centimetre: but, for brevity, we shall take intervals of 
‘3 of a centimetre. 

We should first note the probable errors. If, of the WV values, N, lie below X 
and N, above it, the probable error in V, or N, is + QVNLN./N, where Q ="67449 ; 
ie. it is an even chance that V times the true proportion of values below X lies 
between N,—QVN,N,/N and N,+QvVN,N,/N. Thus, for X = 1455 cm., the 
P.E. is + QV479 x 2521 + 3000 = 13°5. Calculating the probable errors, the data 
may be expressed thus :— 


X vy, | ™ | PE F tak fF & P. E 
Z = 
| — | 
13°35 0 | 3000 | - 15°45 2364 636 | +1571 
| 13°65 8 | 2992 + 19 1575 2763 237 +10°0 
| 13:95 38 2962 &. 41 16°05 2925 75 + 58 
| 14°25 157 | 2843 | + 82 16°35 | 2978 22 + 32 
| 1455 479 | 2521 | +135 16°65 2997 3 + 12 
14°85 1077 | 1923 +17°7 16-95 3000 0 _ 
| 15°15 1762 | 1238 | +182 





Now calculate the values of « and of z corresponding to VY, =4N (1+). An 
error of @ in x is equivalent to an error of 20 in $(1 +), so that the above values 
of the Pp. E. have to be divided by 3000z to give the Pp. £. inw. We thus get tne 
data in the form :— 





; | | 
| | 
— = | 
13°35 . 15°45 + ‘800 +017 
13°65 +°077 15°75 | +1°412 +°023 
13°95 -2 } +042 16°05 +1°960 +°033 | 
| 1425 =-} +020 16°35 +2°441 +052 | 
14°55 — ‘996 +019 16°65 | +3-090 +°135 
| 14°85 — ‘361 +016 16°95 +a —- 
| 15°15 + ‘221 +016 | 


i 


This, it should be observed, is merely a statement of facts, and does not involve 
any assumption as to the distribution being really normal. 


6. Interpolation*. By means of these values of 2, we can interpolate for 
values of X lying towards the extremities of the range, where the differences 


* For a fuller discussion of the methods employed in this and the next two sections, see Journal 
of the Royal Statistical Society, Vol. x1. pp. 433—451. 




















W. F. SHeprarp 179 


of N, or N, are usually irregular. Thus, calculating the values of « for X = 14°05 
and 14:15 by means of the first difference alone, and thence calculating 4 (1+ a) 
from Table I. (remembering that # and a are negative), we get the following 
results, as compared with the actual observations :— 


Buk N (1 +a) 


| 
, | | 
P| x | 3(1-a) 3(1+a) | Calculated 
| 


Actual value 





iz 
| 
a = Se Bee 
| 
| 
| 


value 
1405 | —2-032 | 9789 0211 63 61 
1415 | -1829 | -9663 0337 101 | 97 


It is quite possible that the discrepancy between the calculated and the actual 
values is mainly due to the errors of random selection of the 118 individuals lying 
between X = 13°95 and X = 1425. 

7. Certain Special Cases. The method is especially useful (a) where the 
differences in X are irregular or are large in comparison with the standard 
deviation, and (b) in dealing with the “arrays” in cases of normal or nearly 
normal correlation. As an example of the former, Prof. Pearson has provided 
me with the following results obtained by Miss C. D. Fawcett. 


Mottling of Mimulus Luteus. 


X=Number of splotches Less than 50 50 to 61 62 to 71 More than 71 Total 
Number of individuals 18 43 87 56 204. 
This gives three values of x in terms of X, viz :— 
x 494 61 st 714 
2 —~1352+-084  —527+-062 +°599+-063 
The differences in # are ‘825 and 1:126, whereas the differences in X are in the 
ratio of 6:5. Having regard to the probable errors, it is very doubtful whether 


the distribution can be treated as normal. If it can be, the true:values of # may 
be somewhat as follows :— 


xX 493 614 713 
a (corrected) — 1503 — 417 +488 
Ratio of correction to P.E, —1'8 +18 —18 


These would give a mean of 66°61, and a standard deviation of 11°05, the 
probable errors being respectively + 52 and + 37. 
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This, of course, is guesswork. If there are more than three or four values of x 
to be dealt with, we can do our guessing by graphic methods. The values 
of « should be plotted as ordinates, with a mark on each side to shew the probable 
error: and we have then to draw a straight line which shall make the error 
as small as possible, allowance being made for the different values of the probable 
error in the different ordinates so plotted. It must be remembered that the 
errors are not independent, but correlated: the correlation between the errors in 
any pair of values of # being positive. 


If we want to proceed more rigidly, we must use the actual values of 2, 
and take account of differences. This is practically equivalent to regarding X as 
an unknown function of some other quantity Y, whose values are normally distri- 
buted; the relation between X and Y being such that dX/dY is always positive. 


8. Calculation of ordinates. The ordinate Z of the curve of frequency is dV,/dX. In most 
cases this can be calculated directly from the data, by the first formula of (5) (taking w=.,) or 
a similar formula. Where the differences in w are too irregular for this, but the differences in x 
are comparatively regular, we can use the data in the form shewn in § 5: we have then 


Z=N.dk(1+a)/dx. dx/dX =Nz.du/dX. 


The values of dx/dX are given by the data, and the values of z are found from Table IT. 


9. Testing for normal distribution. This is the purpose for which tables 
such as Table I. are most frequently employed. It is not necessary to give 
any examples here. 


10. Calculation of correlation-volumes. The formula, given by Professor 
Pearson*, for calculating the double-integral expressing normal correlation, in- 
volves a factor 

1 


—} (a? + y*) 
on ° / 


which is the product of the values of z as given by Table II. for « and for y 
respectively, and therefore is easily found from that Table. 


Construction of the Tables. 


1l. The tabulated values are all given to seven decimal places (and, in the latter part of 
Tables I. and IL., to ten decimal places). They were originally calculated to two or three more 
places, the final figures being then corrected. Where the final figure was doubtful, the value 
was calculated specially. The differences are taken from the larger table, the last figure being 
corrected : but doubtful values were not specially calculated. 


For constructing Tables I. and II. up to #=2'50, the values of z were found for the inter- 
mediate values ‘005, 015, ‘025... of x by successive multiplication ; each tenth value being 
checked by Newman’s tablet of e-® The differences being then taken, the values of } (1+<a) for 


* Phil. Trans. series A, Vol. 195, pp. 1—47. 
+ Camb, Phil. Soc. Trans, Vol. xm, Pt. 3, 
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x='01, 02... were obtained by quadrature*, and those of z by interpolation. For the remainder 
of the two tables, the values of z were found from those of log,)2z, which are easily calculated, and 
thence the values of }(1+a) were obtained by quadrature. For checking the table of $(1+<a), 
values were directly calculated at intervals+: and both tables were further checked by the 
calculations required where a final figure was doubtful. 


For constructing Tables ITI. and IV., the values were first obtained approximately to seven 


places: and the tables were then extended, by a method explained elsewhere}, The extension gave 
z to nine and to eleven places, The table was checked hy direct calculation for a="1, *2, °3.... 


* See Proc. of Lond. Math. Soc. Vol. xxx1. pp. 479—482. 

+ Some use was also made of Burgess’s tables (Trans. Roy. Soc. Edin., Vol. xxx1x., Pt. 2, No. 9), in 
which a is given (to a large number of figures) in terms of t=2/V2. But they were only used incidentally 
and the two sets of tables may be regarded as independently calculated. 

t Proc. of Lond. Math. Soc, Vol. xxx. pp. 423, 439. 


TABLES I. anp II. Area and Ordinate in terms of Abscissa. 


Note. 


For values of the abscissa 2 from ‘00 to 4°50, the values of the area $(1+ a) 
and of the ordinate z are given to 7 decimal places (pp. 182-7). For values of x 
from 4°50 to 6:00, the values of $ (1+ 4) and of z are given to ten decimal places 
(p. 188), but the initial figures are omitted. Hence, in using this latter portion 
of the tables the figures in the column for $(1+a) must have 99999 prefixed, 
and those in the column for z must have sufficient zeros prefixed to bring up 
the total of decimal tigures to ten. For example, against #= 5°75 we have 99955 
and 264, but we must read } (1 + a) = 9999999955 and z= :0000000264. 














182 


$(1+a) 


*5000000 
‘5039894 
‘5079783 
5119665 
5159534 
5199388 


*5239222 
5279032 
5318814 
*5358564 
‘5398278 


5437953 
5477584 
5517168 
5556700 
‘5596177 


5635595 
5674949 
5714237 
5753454 
5792597 


5831662 
5870644 
5909541 
5948349 
5987063 


6025681 
6064199 
6102612 
“6140919 
‘6179114 


6217195 
*6255158 
“6293000 
6330717 
6368307 


“6405764 
6443088 
6480273 
6517317 
6554217 


“6590970 
6627573 
6664022 
“6700314 
"6736448 


6772419 
6808225 
6843863 
6879331 
6914625 
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A 
+ 


39894 
39890 
39882 
39870 
39854 
39834 


39810 
39782 
39750 
39714 


39675 


39631 
39584 
39532 
39477 
39418 


39355 
39288 
39217 
39143 
39065 


38983 
38897 
38808 
38715 
38618 


38518 
38414 
38306 
38195 
38081 


37963 
37842 
37717 
37589 
37458 


37323 
37185 
37044 
36900 
36753 


36602 
36449 
36293 
36133 
35971 


35806 
35638 
35467 
35294 
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A? 





3989423 
3989223 
3988625 
3987628 
3986233 
3984439 


*3982248 
‘3979661 
‘3976677 
3973298 
3969525 


“3965360 
*3960802 
*3955854 
*39505 17 
*3944793 


‘3938684 
*3932190 
“3925315 
“3918060 
*3910427 


3902419 
3894038 
“3885286 
3876166 
3866681 


*3856834 
3846627 
3836063 
3825146 
*3813878 


“3802264 
*3790305 
*3778007 
*3765372 


*3752403 


*3739106 
3725483 
*3711539 
‘3697277 
‘3682701 


3667817 
3652627 
3637136 
3621349 
“3605270 


3588903 
3572253 
*3555325 
3538124 
*3520653 


199 
598 
997 
1395 
1793 
2191 


2588 
2984 
3379 
3773 
4166 


4558 
4948 
5337 
5724 


6110 


6493 
6875 
7255 
7633 
8008 


8381 
8752 
9120 
9485 
9847 


10207 
10564 
10917 
11268 
11615 


11958 
12298 
12635 
12968 


13297 


13623 
13944 
14262 
14575 
14885 


15190 
15491 
15787 
16079 
16367 


16650 
16928 
17202 
17470 


a i) 


Mit Cs HH 
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$(1+a) 
6914625 
6949743 
6984682 
*7019440 
*7054015 
*7088403 


‘7122603 
‘7156612 
*7 190427 
*7224047 
*7257469 


‘7290691 
°7323711 
*7356527 
*7389 137 
*7421539 





*7453731 
*7485711 
‘7517478 
"7549029 
*7580363 


‘7611479 
°7642375 
*7673049 
*7703500 
°7733726 


*7763727 
*7793501 
*7823046 
‘7852361 
‘7881446 


‘7910299 
‘7938919 
‘7967306 
*7995458 
*8023375 


*805 1055 
"8078498 
*8105703 
*8132671 
*8159399 


8185887 
8212136 
8238145 
8263912 
8289439 


8314724 
8339768 
8364569 
8389129 
8413447 


A 
+4 


35118 
34939 
34758 
34574 
34388 
34200 


34009 
33815 
33620 
33422 
33222 


33020 
32816 
32610 
32402 
32192 


31980 
31767 
31551 
31334 
31116 


30896 
30674 
30451 
30226 
30001 


29773 
29545 
29316 
29085 
28853 


28620 
28387 
28152 
27917 
27680 


27443 
27205 
26967 
26728 
26489 


26249 
26008 
25768 
25527 
25285 


25044 
24802 
24560 
24318 
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TABLES I. anv II.—(continued). 

















2 snliip ohana ae 
} | . | a2 | 
A A a - ) «£ A? . a | | 
z os r= £ 5 (1+a) + | be e | + | 
oe Se oS ea bem 
5 $5 > | . *Q ¢ wd | | 9AS “S 707 0 
3520653 | |... 264 1:00 } 8413447 | oyyrg | 242 | 2419707 | oyigg | 0 
3502919 | 17734 | o59 | 101 | -8437524 ne | 242 | -2395511 | S419) | 5 
3ags925 | 17994 | O54 1°02 | 8461358 | S359) | 242 | -2371320 | Siig | 10 
3466677 | 18248 | 949 1-03 | -8484950 | 5555; | 242 | 2347138 | Sigg | 14 
3448180 | ipo, | 244 1°04 | *8508300 | S399 | 242 | -2322970 | Sirs | 19 
"3429439 | 18981 239 1°05 "8531409 29868 241 "2298821 | O4j0, | 2 
3410458 |. 215 234 1-06 8554277 22626 241 "2274696 24097 | 28 
3391243 | 19215 229 1°07 8576903 29286 241 "2250599 | 34064 | 33 
3371799 | 19444 | 994 1-08 | -8599289 | 55145 | 240 | 2226535 | Sapo, | 37 
3352132 | jogo, | 219 109 } "8621434 | dio, | 240 | 2202508 | Sang, | 41 
3332246 | soon | 213 1:10 | 8643339 | S1¢g; | 240 | 2178522 | Soy, | 46 
Q« = | 5 *] ° 202 - 92 "2154582 sie | 50 
3312147 | ony 208 I-11 8665005 21496 239 2 2 2389) 
3291840 | 20307 | 903 12 | -ses6a31 | 312 | 239 | -2130691 i} % 
3271330 | 20510 | j97 | 1:13 | 8707619 | So959 | 238 | 2106856 | 53030 58 
3250623 | 3% nal 192 | 1:14 | 8728568 | So-15 | 237 | -2083078 rich 62 
rygu aco }. 20895 ous e oh ati } 20712 asad 2059363 | 7341 | -* 
3229724 | Siogg | 18% | 1:15 | -8749281 ) 20475. | 237 2059363 | 95639 | 68 
3208638 | scales 181 | 1:16 “8769756 | 20939 | 236 "2035714 23578 70 
3187371 | 21267 | 176 1:17 | 8789995 | S599, | 235 | 2012135 | Sov | 74 
*3165929 21 : s 170 1°18 “8809999 19769 235 “1988631 23496 | 18 | 
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TABLES II 
I. and IV. Abscissa and Ordinate in terms of difference of Areas 
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New Tables of the Probability Integral 
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VARIATION IN ‘““EUPAGURUS PRIDEAUXI” 
(HELLER). 


By E. H. J. SCHUSTER, B.A. 


I. 


Introduction. The work on which the following paper was based was done in 
the winter of 1901—1902 during my occupancy of the Oxford University table at 
the Naples Biological Station; and I take this opportunity of expressing my 
appreciation of the kindness and courtesy of those members of the staff of that 
institution with whom I came into contact. 

The subsequent calculations were done under the direction of Professor Weldon, 
and I herewith tender him my sincere thanks for devoting so much time and 
trouble to the purpose. I have also to thank Professor Pearson for his numerous 
and valuable suggestions. 


The work itself is an attempt to determine whether members of the species 
Eupagurus prideauxi caught in shallow water, differ with regard to certain charac- 
ters from those caught in comparatively deep water. 

For this purpose the following three measurements were taken : 

1. From the upper articulation of the propodite with the carpopodite of the 


right chela to the upper and outer articulation of the propodite with the dactylo- 
podite, called Measurement No. 1. Fig. 1, AB. 
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2. The length of the right chela measured from its lower articulation with 
the carpopodite, to the furthest point of the fixed blade of the scissors. Fig. 1, CD. 


3. The length of the carapace along the median line. 
These measurements were taken with a pair of dividers provided with a screw 


fine adjustment and an ivory scale divided into 4 millimetres, they are believed to 
be accurate to ;/; of a millimetre. 


From Measurements No. 1 and No. 2 an index called the chela index was 
deduced, which is intended to represent the proportion which the claw bears to 
the whole length of the chela. 


at ‘ Measurement No. 2 — Measurement No. 1 
The chela index = - — — 
Measurement No. 2 





About two thousand individuals were measured, which were separated into two 
main groups : 


I. Shallow water forms, from a depth of 35 metres or under ; 
II. Deep water forms, from a depth of over 35 metres. 
Owing to the great difference in size between the sexes each of these groups 


was again subdivided into male and female. Thus finally we have to deal with 
four sets with about five hundred individuals in each. 


If. 


Comparison between deep water forms and shallow water forms as regards each 
measurement and the chela index taken separately. The deep water males are in 
each case compared with the shallow water males and the deep water females with 
the shallow water females. 


TABLE IL. 


Measurement No. 1. of. 


| Probable error of 


| Mean | ene Standard Deviation 
Deep water forms... | 9°7078 mm. | 0847 mm, 2°7588 mm. 
Shallow water forms | 10°2718 mm. 0745 mm. 2°5901 mm. 
a Pr 
Difference 0°5640 mm. | Probable error of difference=*1128 mm. 
| 





This measurement is greater in the shallow water forms by an amount exactly 
five times as great as its probable error, and therefore almost certainly significant. 
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TABLE II. 
_Measurement No. 1.  . 


caesarean a | 


Mean | Probable error of 








BEES Standard Deviation | 

A cragites 5 RS Ieee. ES Bernt nn. 
| | 
Deep water forms... | 7°4000 mm. ‘0330 mm. 10607 mm. 
Shallow water forms 74850 mm. 0293 mm. 1°0234 mm. | 
Farag 5 mo: | 

Difference ... ... | 0°0850mm. | Probable error of difference =-0441 mm. | 


| | | 





This measurement is greater in the shallow water forms by an amount about 
twice as great as its probable error. 
TABLE IIL. 
Right Chela Length. ¢. 





Probable error of ee 
| . Dev 
| Mean ibn Standard Deviation | 
| Ps — ae ae Se ee ee a a a a — 
wun - meee 
| Deep water forms... | 17°9676 mm. "1451 mm. 4°7279 mm. 

Shallow water forms | 18°6773 mm. *1259 mm. 4°3769 mm. 
aes eS ees = eee, 2 | 
| Difference gis oes ‘7097 mm. | Probable error of difference =*1922 mm. 


This measurement is greater in the shallow water forms by an amount between 
three and four times as great as its probable error, and therefore is probably 
significant. 


TABLE IV. 
Right Chela Length. @. 


Probable error of 


Mean aaa Standard Deviation 
Deep water forms... | 14°1415 mm. 0612 mm. 19679 mm. 
Shallow water forms | 13°9735 mm. 0522 mm. 1°8215 mm. 
a = 7. ee | 
Difference wee) Bee ‘1680 mm. | Probable error of difference =°0805 mm. | 


This measurement is greater in the deep water forms by an amount twice as 
great as its probable error. 








To sum up with regard to the absolute measurements of the right chela; in 
the male they are both significantly greater in the shallow water than in the deep 
water forms; while in the female Measurement No. 1 is greater in the shallow 
water forms, Measurement No. 2 in the deep water forms, and in neither case is 
the difference, though well marked, sufficiently great to be called significant. 

Biometrika 1 25 
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TABLE V. 





| 

} Mean 

st | _ 

| 

| Deep water forms... 461348 
Shallow water forms | 451045 

_— | 
| Difference 010303 


The chela index is greater in the deep water forms by an amount more than 


Chela Index. ¢. 


Den » 
Probable error of | Standard Deviation 


mean | } 

_ | —" 
000538 | 017537 | 
000472 016417 | 





Probable error of difference=*000715 | 


13 times as big as its probable error, and therefore certainly significant. 


TABLE VI. 


| } Mean 

| 

r ; ar 

Deep water forms ... 475051 

| Shallow water forms | ‘471312 
I cutcaias 

| Difference ... ... | 003739 

| & 





The chela index is greater in the deep water forms by an amount more than 
six times as big as its probable error, and therefore almost certainly significant. 

Thus Tables V. and VI. show that in the forms taken from deep water, both 
male and female, a greater proportion of the whole length of the chela is taken up 


Chela Index. Q. 








‘oe 
Probable er - a 
robable error of | Standard Deviation 


mean 
| 
000458 | 014721 
000385 013429 


Probable error of difference = 000598 


by the blade of the scissors than in forms taken from shallow water. 


TABLE VIL 


Carapace Length. ¢. 


Mean 


Deep water forms ... 


8°5854 mm. 
Shallow water forms | 


8°4063 mm. 


Difference 1791 mm. 


The mean carapace length in the deep water forms is greater than that of the 
shallow water forms by an amount more than twice as great as its probable error. 


Probable error of 


mean Standard Deviation 


0512 mm. 
0425 mm. 


1°6696 mm. 
1°4918 mm. 


Probable error of difference = ‘0665 mm. 
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TABLE VIII. 
er insti - 





"Probable error of ’ — 
Mean | SERN Standard Deviation | 
Deep water forms... 7°5405 mm. 0293 mm. ‘9417 mm. 
Shallow water forms 71222 mm, | 0247 mm. ‘8631 mm. 
— —————— — | 
| 


Difference ... ... ‘4183 mm. | Probable error of difference = °0383 mm. 


The mean carapace length in the deep water forms is greater than that of the 
shallow water forms by an amount about ten times as great as its probable error. 

Thus in both male and female the mean carapace length is greater in the deep 
water forms, in the female by a certainly significant amount, in the male by a 
quantity which taken alone could hardly be taken as significant, but viewed 


in the light of the result obtained by the female it may possibly be considered 
to be so. 


III. 
On the Comparative Variability of Deep and Shallow Water Forms. 
TABLE VIII bis. 





Table Deep Water Shallow Water | Difference of 


| | aA : : 
Character, Sex $.D. |C.ofV.| SD. |C.ofV.| 8S. D.’s |C.’sof V.| 
| 
| 


ee, Se ee — 








i ad §| 2°7588 | 28-418 | 25901 | 25-216 | 1687 | 3-202 

Measurement No. 1 ¢ | 4.9599 | +-665 | +-0527 | +544 | +0798 | +-865 
(| 10607 | 14334 | 10234 | 13673 0373 | ‘661 | 
” » 24) +0933 | +322 } £0207 | +278 | +0312 | 4-425 | 
a wane i <a 
rad LA [A ele | | 
a §| 47279 | 26313 | 43769 | 23-434 | +3510 | 2879 | 
Right Chela Length 6 )| 4.1996 | 4-609 | +-0890 | +502 | +1359 | +-789 | 
ae 3 + | | | 

{| 19679 | 13-916 | 1:8215 | 13036 | 1464 | 880 

” » 2)! 4.0433 | +312 | +0369 | +269 | +0569 | +-412 

( » Mice {| 16696 | 19-446 | 14918 | 17-746] 1778 | 1-700 

arapace Length ¢ ..-)/ 4.9362 | +-438 | +0303 | +-372 | +0470 | +°575 

\| 9417 | 12489 | ‘8631 | 12-118 0786 | ‘371 

ed 

” ” )| +°0207 | +:279 | +0175 | +:249 | +0271 | +:374 
a eee ns Crs —| ma | 
| | | 
Ghids Tada 3 §| 01754 | 38013 | 01642 36-404 | 00112 | 1-608 | 
ee 2! +-00038 | +:936 | +-00033 | +°833 | +-00051 |+1:253 | 

| 01472 i 988 | 01343 i: 493 | 00129 | 2-495 

| ) | | 
” » & “2! +:00032 | + +°744 | +:00027 | +°622 | +°00042 |+ 970 


a: = 2 ee Ww 


25—2 


PI 
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The table above gives the standard deviations and the coefficients of variation* 
: with their probable errors, and we see : 


(i) That in both forms the male, whether we judge by standard deviation or 


coefficient of variation, is, for the characters considered, much more variable than 
the female. 


(ii) That there are significant differences in the variability of the deep and 
shallow water forms. The deep water forms are in every single case the more 
variable, however variability be estimated. For males the difference is always 
greater and often much greater than its probable error; for females the difference f 
is less marked, but none the less quite obvious. 


| ee : 
| We can therefore conclude that the conditions of life are probably far more 
| stringent for the shallow than the deep sea forms, and for the females than for the 


. 
males. It would, perhaps, be rash to assert that the shore crabs are a selection 
from the deep sea form, but the facts as to variability are not only compatible with 
but indeed suggestive of such an hypothesis. 
IV. 
Comparison of the correlation between the length of carapace and each of the 
| other measurements for deep and shallow water forms. 
To commence with the consideration of the males, Table IX. shows the correla- 
tion between the length of the carapace and Measurement No. 1 for deep, Table X. 
for shallow water forms. 
For the former 7 = 9458 + ‘0032, 
For the latter xr = ‘9337 + :0037, 
The difference =-'0120 + 0049. 
* Percentage variation on the mean, i.e. 100 x standard deviation and divided by the mean. 
a 
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TABLE X. 


Correlation between Length of Carapace and Measurement No. 1. 


rf 


Shallow Water Forms. 


Variation in 


“Eupagurus Prideauxi” 
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| Diagram I. shows the regression lines of Measurement No. 1 on the carapace 


for these two depths; the dotted line is for the deep water forms, the continuous 
line for the shallow water forms. It will be noticed that one carapace length is 


associated with a much smaller value of Measurement No. 1 in the deep water 
forms than in the shallow. 


Diagram I. Measurement No. 1. 
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Comparison between Regression Lines of Measurement No. 1 on Carapace Length between Shallow 
Water ¢¢(BB) and Deep Water gg(DD). 





b Shallow Water ge. 
Mean Measurement No. 1, 10°2718 mm. 
Mean Carapace Length 8°4063 mm. 
Regression of Measurement No. 1 on Carapace Length 1°6212. 


Deep Water 33. 
Mean Measurement No. 1, 9°7078. 
Mean Carapace Length 85854. 
Regression of Measurement No. 1 on Carapace Length 1-5628. 


Table XI. shows the correlation between the length of the right chela and the 
length of the carapace for deep, Table XII. for the shallow water ¥ forms. 
For the deep water forms r= ‘9389 + 0036, 
For the shallow water forms 7 = ‘9503 + ‘0028, 
Difference = 0114 + 0046. 
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Variation 


Deep Water Forms. 


Carapace Length (millimetres). 


J. 


Correlation between Carapace Length 


in “Eupagurus Prideauxi” 


TABLE 


XII. 


Chela Index. 


and Chela Index. 
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Diagram II. shows the regression lines of the chela length on the carapace 
for deep and shallow water forms, the dotted line representing that of the 
deep water forms as in Diagram I. The diagram shows that the same carapace 
length is associated with a smaller chela length in the deep water forms than 






































t ° . ‘ 
in the shallow, but the difference between deep and shallow is not so marked 
as in Diagram I. 
Diacram II, Length of Right Chela. 
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Comparison between Regression Lines of Right Chela Length on Carapace Length between Shallow 
Water gg (BB) and Deep Water gg (DD). 
. 


Shallow Water g¢. 
Mean Length of Right Chela 18°6773 mm. 
Mean Length of Carapace 8-4063 mm, 
Regression of Chela on Carapace 1°405, 


Deep Water ¢é. 
Mean Length of Right Chela 17-9676. 
Mean Length of Carapace 8°5854. 
Regression of Chela on Carapace 1°329. 


Tables XIII. and XIV. show the correlation between carapace length and chela 





index for deep and shallow water groups respectively. 


Deep water r= — ‘5224 + 0223, 
Shallow water r = — 4448 + 0231, 
Difference ‘0776 + 0321. 
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Diagram III. shows the regression lines of the chela index on the carapace 
length. The same carapace length is associated with a smaller chela index for the 
shallow water than for the deep water forms. 


DiaGRaM iii. siegression Lines of Chelu Index on Carapace Length in 33 from Shallow y 
Water (BB) and from Deep Water (DD). 
mm 
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Chela Index. 


With regard to the females, Tables XV. and XVI. correspond to Tables IX. 
and X. of the males and show the correlation between the carapace length and 
Measurement No. 1 for the deep and shallow water forms respectively. 

Deep water r= ‘8466 + ‘0088, 
Shallow water 7 = ‘8428 + 0083, 
Difference ‘0037 + :0121. 


The difference being less than its probable error is quite negligible. 
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e TABLE XV. 


; Deep Water Forms. $. Correlation between Length of Carapace and 
Measurement No. 1. 


Measurement No. 1 (millimetres). 
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TABLE XVI. 
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Diagram IV. corresponds to Diagram I. 


It represents the regression lines of 


Measurement No. 1 on carapace length, the dotted line being that of the deep 
water females, the continuous line that of the shallow water females. 


It will be seen that the relation these two lines bear to one another is very 


much the same as that between the corresponding lines of the males. The diagram 


shows therefore that in the females as well as the males the same carapace length 
is associated with a smaller value of Measurement No. 1 in the deep water forms 


than in the shallow water forms. 


Diacram IV. 
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Comparison between Regression Lines of Measurement No, 1 on Carapace between 


Shallow Water ¢¢(BB) and Deep Water ¢¢ (DD). 


Shallow Water 99. 
Mean Carapace Length 7:1222 mm. 
Mean Measurement No, 1 7°4850 mm. 
Regression *999. r= °8428. 


Deep Water 99. 
Mean Carapace Length 7°5405. 
Mean Measurement No, 1 7°4000. 
Regression *9535. r= °8466, 


Tables XVII. and XVIII. show the correlation between the chela length and 
the carapace length in the deep and shallow water females, corresponding to 


Tables XI. and XII. of the males. 


Deep water 


Shallow water 


Difference 


r = 8626 + 0080, 


‘8841 
= 0215 


r 


+ ‘0063, 


+0106. 
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TABLE XVII. 


Deep Water Forms. ¢. Correlation between Length of Right Chela and 
Length of Carapace. 


Length of Right Chela (millimetres). 
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TABLE XVIII. 
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Diagram V. shows the regression lines of the right chela length on the carapace 
length for deep and shallow water females; it corresponds to Diagram II. The two 
lines bear much the same relation to one another as in that diagram. And in the 
same way as in Diagram II. the regression lines run closer together than in Dia- 
gram I. thus those of Diagram V. run closer together than those in Diagram IV., 
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mm * 
9.8 8.45 9.45 10,46 11.45 12.45 13,45 1445 15.45 1645 17,45 1845 19 45 
. | | ] | r | 
j | } Dv /B 
} j | 4 
9.3 t ; 4 +> 
| 








- 
~~ 
4 | 


4 





8.8 ‘ 




















RN 





Length of Carapace. 
ow 
188) 

















io onl 
oa 
‘4 
4.8 LA 
< 
4.3 (ZB 


Comparison between Regression Lines of Right Chela Length on Carapace 
Length of Shallow Water ¢¢ (BB) ard Deep Water ¢¢(DD). 















































Shallow Water 9. 
Mean Carapace Length 7°1222 mm. 
Mean Chela Length 13°9735 mm. 
Regression °9329. r= 8841, 


Deep Water 99. 
Mean Carapace Length 7°5405 mm. 
Mean Chela Length 14-1415 mm. 
Regression ‘9013. r= '8626. 


showing that these points in relation to the chela and carapace for the deep and 
shallow water forms which are illustrated by these diagrams are much the same 
in the males as they are in the females. Namely that the same carapace length 
is associated with a smaller value for Measurement No. 1 in the deep water forms 
than in the shallow water forms, also with a smaller value of Measurement No. 2, 
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but that neither in the male nor in the female is the difference of the values of 
Measurement No. 2 so great as of Measurement No. 1. 


It was found that in the females the value of r for the correlation between 
chela index and carapace length was less than its probable error and the regression 
practically nothing and so neither the correlation tables nor the diagram of the 
regression lines are given here. 


V 
Comparison of Regression and Correlation Coefficients. 


TABLE XIX. 
Regression Coefficients of First Order. 





Male Female 


Regression Cogiicients 





| | 
Deep Sea | Shallow Deep Sea | Shallow 





Measurement No. 1 on Carapace Length 15628 16212 9535 9994 
Carapace Length on Measurement No. 1 | ‘5723 5378 "7516 ‘7108 
Right Chela Length on Carapace Length 1:3293 1-4056 9013 9329 
Carapace Length on Right Chela Length 6633 ‘6477 8255 ‘8379 
Chela Index on Carapace Length ... ... | —°2744 — *2447 — | — 
Carapace Length on Chela Index .., ... | — 9947 — 8084 — | —- 














Now at first sight it might appear that the greatest differences here between 
deep sea and shallow forms occur when we consider the regressions of the other 
organs on carapace length. But if we are dealing only with the question of signi- 
ficant differences, we must note that the probable errors are much larger in these 
cases. Thus the regression of right chela length on carapace length for deep sea 
males must be read 1°3293 + ‘0298, while for the inverse regression of carapace 
length on right chela length we have 6633 + ‘0037. Hence the change in the 
latter regression from shallow to deep sea forms although so much smaller is 
really more significant. If, however, we suppose the changes between the regression 
coefficients for the two forms to be due, not to the random character of the samples, 
but to the direct action of natural selection, the greatest absolute differences 
are clearly in the regressions of the other organs on the carapace length, or the 
suggestion naturally follows that the differentiation between the deep sea and 
shallow forms is due more to a direct selection of the chela than of the carapace 
length. It would be interesting to determine whether this suggestion is or is not 
strengthened when the regression coefficients of higher orders are investigated. 


Exhibiting all the correlation coefficients in a single table we have 
Biometrika 11 27 
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TABLE XX. 
Correlation Coefficients. 
| Male Female 
| Characters 29 ] 
Deep Sea | Shallow Deep Sea Shallow 

| 

Measurement No, 1 and Carapace Length| 9458 | +9338 ‘8466 | °8428 

Right Chela Length and Carapace Length | ‘9389 9503 8626 8841 
| Chela Index and Carapace Length .. | — 5224 — °4448 Insensible | Insensible 








L | | 








Or, we conclude that the male has the parts investigated far more highly corre- 
lated than the female. We have thus the males more variable and more highly 
correlated than the females. It is remarkable that in man the female is on the 
whole more variable, and more highly correlated than the male. There would 
appear therefore to be a relation between high variability and high correlation. 
Or, perhaps, it would be better to say that the more stringent the selection the 
smaller is the variability, and as a rule the correlation. Interpreted in this manner 
we might hazard the suggestion that the female crabs of both forms are subjected 
to more stringent selection than the males. 


VI. 

General conclusions. (a) We have seen that with respect to the three 
characters measured certain differences do exist between those animals which have 
been living in deep water and those which have been living in shallow water. 
These differences may be congenital or they may be produced afresh in any gene- 
ration. The latter alternative is more likely than the former as the larvae are 
pelagic and those produced by deep water parents must get mixed up pretty 
effectually with those that spring from shallow water. It is of course possible that 
the former find their way to the deep water in settling down to the bottom and 
the latter to the shallow water, but this is rather difficult to imagine; or that 
they sink down without regard to the depth of the water, but die off immediately 
if they are not in approximately the same depth in which their parents lived. It 
is a problem of the same kind as is connected with the existence of local races in 
many animals with pelagic larvae. 

In the case of the difference arising separately in each generation, it must be 
produced either by selection or by the direct action of the environment and we 
have no data for determining which of these is the case. 

(b) The male is certainly more variable and more highly correlated than the 
female in both deep sea and shallow forms. 


(c) In both sexes the deep sea forms are more variable than the shallow water 
forms. 


These facts are consonant with a greater selection of female than male and of 
shallow water than deep sea forms. 











THE LAW OF ANCESTRAL HEREDITY*. 
By KARL PEARSON, F.R.S. 


(1) To any one who has made a close study of heredity, whether it be in men, 
mice or plants, there are one or two conclusions which must be accepted at once. 
The first of these conclusions is: 


(a) That a knowledge of the characters of the parents does not accurately define 
the character in the offspring. 


Every one is familiar with the fact that given two pairs of parents possessing 
the like character in the same degree the offspring of one pair will differ from 
those of the other in respect of this character, and further that all the offspring of 
the same pair are not alike in the character; and this variation in the offspring 
of the same pair or of like pairs of parents may be very great indeed. 


This point although familiar is frequently neglected. It is so important that 
it seems worth while to illustrate it by material at my disposal. The pedigrees I 
give are only samples of many hundreds in my possession, but they are conclusive. 
I turn first to coat-colour in thoroughbred horses. 


(i) The mare Ally (bay) covered with Little John (bay) gave a roan colt. 
There are no black coats in the ancestry of either Ally or Little John back to 
their great-grandsires and great-grandams. Jest Ally’s dam has, however, the 
Delpini grey colour, which accounts for the roan. Ally again covered, this time 
by Interpreter (bay), gives a black filly. There is no black blood in Ally, but 
Interpreter’s paternal grandsire was the well-known black horse, Socerer. Thus 
two bay horses may give a roan, or two bays may give a black foal. I can cite 
any number of instances, of course, in which they give bay, brown or chestnut 


offspring. 


* The following paper is put together not as a reply to Mr W. Bateson’s rhetorical attack on the Law 
of Ancestral Heredity (published in his recent book Mendel’s Principles of Heredity), but simply 
to indicate to those interested in the matter what are really the fundamental assumptions involved 
in the Law of Ancestral Heredity, and how far it enables us to describe actually observed experience in 
man, horse and dog, which I am unable under any hypothesis to bring under Mendel’s “Principles.” In 
the course of this year various investigations on heredity in both plant and animal life will I hope be 
published, and these will tend to throw further light on the laws of heredity. 
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(ii) Socerer was a black horse, son of the black T’rumpator, out of Young 
Giantess, a bay. But the four grandparents of Socerer were two chestnuts, a 
brown and a bay, and of the eight great-grandparents seven were bays and of one, 
the Snap Mare, I have not been able to ascertain the colour. Where did Socerer 
and Trumpator get their blackness from? Snap was brown, but his maternal 
grandsire was the black horse Gypsy. In the same way Rinaldo black was born 
of the chestnut Whiskey Mure covered by the brown stallion Milo, but in the 
fourth and fifth generation backwards comes in the black Gypsy blood through the 
brown stallion Snap. 


(iii) A bay filly is the result of crossing the chestnut stallion General 
Graham and the grey Beningborough Mare. The chestnut stallion Prophet 
and the grey mare Virago give a grey filly. The chestnut stallion Woodpecker 
and the grey Herod Mare give a chestnut filly Chestnut Skin. Clearly the coat- 
colour of the parents will not define that of the offspring. The bay filly, however, 
of the General Graham and Beningborough Mare cross is elucidated when we 
know that while it had one chestnut and one grey grandparent, it had two bays 
for grandparents. 


I could multiply these pedigrees indefinitely, but the above will be sufficient 
to demonstrate the point for horses, ie. the coat-colour of a horse may be un- 
intelligible unless we examine the ancestry. We may easily find offspring of 
all shades from grey to black, whose parents had the same colour, say both bays. 


I now pass to dogs. Let us first take Basset Hounds. Here the colours 
are lemon, white and black, and if they all occur the hound is termed tricolour. 


(i) In 1885 the tricolour bitch La Fanfare put to the tricolour dog 
Bourbon gave the tricolour pup Bluette—melanism appeared in the offspring. 
In the same year the tricolour bitch Queen Dido covered by the tricolour dog 
Bourbon gave the lemon and white pup Blonde, or melanism disappeared. In 
1887 the lemon and white bitch Jessie crossed with the lemon and white dog 
K. Bendigo gave the tricolour hound Bendigo IJ. Thus when melanism is present 
in the parents it may be absent in the children, or when absent in the parents, it 
may reappear in the children. 


(ii) His Lordship was tricolour, but his parents Scipio and Fama were only 
lemon and white. His black becomes quite explicable when we note that of his 
four grandparents two were tricolour. 


(iii) Or, in the same litter, the tricolour bitch Jris by the tricolour dog 
Count casts five hounds of which three are tricolour and two lemon and white 
only. It is thus impossible to predict from the character of the parents whether 
black will or will not appear in the offspring. But here again ancestry throws 
light on the matter. Count’s parents Ugly and Rosalind II. were both lemon 
and white and Count got his black from his paternal grandparents, who were both 
tricolour. Of Jris’ parents one showed melanism and the other did not. 
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What we are dealing with here is not a slight variation in shade, but the 


presence or total absence of black pigmentation. This point must be borne in 
mind. : 


These cases might be enough, perhaps, for dogs, but I cite the following 
instances out of many for greyhounds *. 


I would first state that I have sought in vain for any single colour or com- 
bination of colours which breeds true to itself. There is as far as I have been 
able to ascertain absolutely nothing that can correspond to Mendel’s “ recessive” 
character. At first it seemed to me possible that black would be such. Bessie 
Hay, a black bitch covered by Grinlaw a black dog, had a litter of entirely black 
offspring in 1896, and repeated this performance with four instead of five whelps 
in 1897. But the black bitch Finest Fury covered by the black dog Petronius in 
a litter of eight had five black and three red offspring. Black Middleton and black 
Raven gave three black, one blue, a red and a fawn offspring, while the black 
bitch Mayfly crossed by the black Hermas produced four black and two fawn 
offspring. In fact almost every possible greyhound colour will be found in the 
offspring of two black parents,—black, blue, either of these and white, fawn, red, 
brindled! If we next take such a cross as black and red we find red Recoil 
covered by black Fortuna Favente giving six black offspring and we at once 
consider whether black may be “dominant.” But the red bitch Maid of all 
Work threw in a litter to the black dog Black Marauder six brindled, one black, 
one blue offspring and a white bitch with one eye and two ears black. Black 
Nebula crossed by red Herschel gave five fawn offspring, one black and one blue 
and white. Parents of the same colours will be found to give litters of red, 
and black and white offspring. A blue sire (Blue Peter VIJJ.) and a black dam 
(Flotilla) will produce all the offspring red, while a black bitch covered by a 
black and white sire will give pure black dogs, pure white dogs, or mixtures 
of black, red or fawn with white. The black bitch Queen of the Colonies covered 
by the red fawn dog Dodger gave beside five black and three red offspring a 
white bitch. No single colour breeds true to itself, and while all these results are 
explicable on the basis of ancestry, the Mendelian can only pass them over 
by asserting that the greyhound is an incorrigible mongrel. 


We may next pass to eye-colour in man and I give the following instances. 
Father and mother both blue, two out of six children dark brown. Father and 
mother both blue, one out of six children dark brown (colour of father’s maternal 
grandfather). Father and mother both light grey, four children light grey, and 
five black or very dark brown (colour of maternal grandmother), Now turn 
to the opposite end of the scale: Father and mother black or very dark brown, 


* Pedigrees taken from data collected by Mr Howard Collins. 

+ I gave the substance of such pedigrees to Prof. Weldon, who made use of it in his article on 
Mendel’s Law in Biometrika (Vol. 1. p. 242) in a passage criticised by Mr Bateson in his Defence of 
Mendel, p. 192. I first tested Mendel’s theory of dominance on eye-colour in man when I read 
Mendel’s paper in 1900 and then communicated the result to Professor Weldon. 
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one of eight children light blue (colour of paternal grandfather). Father black 
and mother dark brown, one child out of three hazel (the colour of paternal 
grandmother). Father and mother very dark brown or black, two children blue 
and one black. Father and mother very dark brown or black, two out of four 
children blue (colour of maternal grandmother). It is clear that neither end 
of the eye-colour scale gives a recessive character which breeds true to itself. 
Now note what we get if we attempt to select for several generations paying 
attention to ancestry. Father and mother blue, all four grandparents blue, and 
the five recorded great-grandparents blue, all four children blue. Parents and 
grandparents all blue, all eight children blue. Father and mother light grey, 
and the four grandparents all light grey or blue, all seven children grey. Father 
and mother grey and all the known ancestry of both grey or blue, all four 
children grey or blue. Father and mother both dark brown, all known ancestry 
brown for both, out of nine children seven dark brown, one brown and one light 
brown. Father and mother hazel, all ancestry to great-grandparents hazel with 
the exception of a blue-eyed grandfather, all five children hazel. 


Lastly let us look at crosses between blue and dark eyes. Father light blue 
and mother dark brown, three children blue, four children dark brown. Father 
dark brown and mother blue, two children blue and four brown-eyed. Father 
dark brown and mother blue, three brown and six blue-eyed children; here all 
mother’s known ancestry was blue. If blue were “dominant” such results are 
impossible unless we suppose every dark brown parent above recorded was a 
hybrid. If blue be “recessive” it ought to breed true, but we have seen that 
it does not. But as a matter of fact we can pick out cases from the record 
of one blue and one very dark parent giving almost every conceivable result, 
e.g. father light blue and mother black, four children blue and one light grey. 
Father brown and mother blue, eight children ranging from brown to dark brown. 
In all cases an examination of ancestry throws light, if it does not fully explain in 
each case what has taken place. It is clear from these cases that parentage 
will not enable us to predict offspring definitely. But what is also equally clear is 
that if all the ancestry be the same the offspring may differ among themselves 
considerably, e.g. by the presence or total absence of black pigmentation. Hence 
we reach our second conclusion : 

(b) That a knowledge of the whole ancestry while it certainly limits the range 
of variation does not absolutely define the character of the offspring. 

(2) Before we leave the cases above it is just worth reiterating that nothing 
corresponding to Mendel’s principles appears in these characters for horses, dogs, 
or men. If black or grey coat-colour in horses were “ recessive,” when two blacks 
were mated we should expect only black offspring, but black can disappear for a 
generation or even two and then reappear. Or, take a case like that of a grey 
horse Viscount, where grey remained dominant for three generations only to 
disappear before the chestnut of the mare Blue Stocking in the Viscount and Biue- 
Stocking filly Miss Johanna! It is the same with every coat-colour taken, its 
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relative constancy depends largely on the extent to which it has appeared in the 
ancestry, and one by one black, bay, chestnut, grey must be dismissed by the 
Mendelian as neither “recessive” nor “dominant,” but as marking “permanent 
and incorrigible mongrels.’ The same remark applies to coat-colour in hounds, 
black must be the mark of incorrigible mongrelism for it is neither recessive 
nor dominant. It is the same with red and white, no colour in the parents gives 
for every case consistent offspring. 


Nor again do we find that two black-eyed human beings nor two blue-eyed 
human beings mated together give any sign of the dominance of one or other 
of these extremes; any more than do intermediate tints to which the description 
of “incorrigible mongrelism ” may by some be applied. It will be clear that 
for man (and this applies not only to eye-colour, but to hair-colour, and to a 
whole series of measurable characters * of which we have in each case more than 
a thousand instances in our Family Records), for horses in coat-colour, and for 
dogs in coat-colour + nothing approaching Mendelian principles holds. We have 
therefore to classify large portions of the animal kingdom as exceptions to Mendel’s 
Laws,—and these are cases where the evidence is not based on five or ten in- 
dividual crosses followed perhaps for two or at most three generations—but 
on 1000’s of crosses, and where the pedigree has been or can be investigated for 
some of the material for five to ten generations. What may happen in the 
case of plant hybrids, I am not able from personal observation to assert, but there 
is enough weight of evidence here to make one pause before one is prepared to 
admit that Mendel, or his followers, can change each conception of life in which 
heredity bears a part ! 


(3) Taking our stand then on the observed fact that a knowledge neither of 
parents nor of the whole ancestry will enable us to predict with certainty in a 
variety of important cases the character of the individual offspring we ask : What 
is the correct method of dealing with the problem of heredity in such cases? The 
causes A, B, CO, D, EL, ... which we have as yet succeeded in isolating and defining 
are not always followed by the effect X, but by any one of the effects U, V, W, 
X,Y. Weare therefore not dealing with causation but correlation, and there is 
therefore only one method of procedure possible; we must collect statistics of the 
frequency with which U, V, W, X, Y, Z respectively follow on A, B, C, D, E.... 
From these statistics we know the most probable result of the causes A, B, C, 
D, E and the frequency of each deviation from this most probable result. The 
recognition that in the existing state of our knowledge the true method of 
approaching the preblem of heredity is from the statistical side, and that the 
most that we can hope at present to do is to give the probable character of 
the offspring of a given ancestry, is one of the great services of Francis Galton to 
biometry. 


* Mendelianism fails also for skin-colour in crosses between the black and white races of man. 
+ Other characters in peciigree stock are being taken into consideration at present. 
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Yet since the publication of Francis Galton’s Natural Inheritance in 1889 much 
progress has been made in the theory of multiple correlation, and also in our 
treatment of characters not quantitatively measurable, but which we may reason- 
ably suppose to be based on quantitative factors,—as the degree of yellow pigment 
in the human eye. Further larger series of observations have been collected 
and reduced, and we are much clearer than we were ten years ago on the nature 
of regression and ‘blend.’ 


Given two relatives P and Q the general theory of statistics tells us how 
to dress a table from which we can read off at once the most probable character 
of P when we know the character of Q. There is no biological theory in this, 
it is merely an arrangement of statistical information into classes. Such classes 
may be arranged on an actual quantitative scale, or as in the case of eye or hair 
colour on a qualitative scale, which with a high degree of probability corresponds 
to a true quantitative scale in some one or other pigment, whose amount cannot 
be actually determined. In both cases the methods of statistics allow us to 
determine : 

(a) The variability in character of both P and Q in terms of the range 
of one of these arbitrary classes. 

(b) The average variability in the same terms of an array of P’s corre- 
sponding to a given Q, or of an array of Q's corresponding to a given P. 

(c) The first two terms of the closest linear function which expresses 
the probable character of P or Q in terms of the known character of Q or P. 


Inheritance of Cubit. 
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Cubit of Father in Inches. 


Probable value of cubit of son Q, for a known cubit in father P. 


A marked deviation from linearity 
only occurs in the extreme case of father’s cubit, where there were only a few instances to base the 
son’s mean upon. 
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All this has nothing in it peculiar to heredity, it is simply an application 
of the higher theory of statistics. 

If we ask how closely the above linear function gives the probable Q-value 
from the known P-value; the answer is: Admirably, for those characters which 
have been actually tested. I have 21 tables involving three characters for diverse 
relationships in man and each embracing upwards of 1000 pairs. I choose one, 
the cubit, perfectly at random and it will be seen at once that, within the limits 
of random sampling, no function could give the probable character of Q for a given 
value of P with greater efficiency. 

The line here dealt with is the so-called regression line; it is a purely 
statistical result, and has no relation to any biological theory or hypothesis as 
to heredity. Its determination depends on the so-called coefficient of correlation 
of P and Q, which we will write r,,, and their variabilities ¢, and o,. In addition 
we must know the mean, the modal or, what is often sufficient, the median values 


of P and Q. 


Now suppose that instead of one relative P we have any number P,, P., Ps..., 
and let the corresponding statistical constants be rp, p,q, Mpg +++ Tp,» Tp,» Tp, 
Then the theory of statistics shows us that, if p, be the most probable deviation of 
Q from the type of its generation, and h,,, h,,, hy,... be the observed deviations of 
P,, P,, P;... from the types of their respective generations, then : 


CG, CG, C, 3 
pg wd, — hy, + Sy — hy, + Fg — lg, ..0......c.ccccrecereees (i), 
op Cn, *' oy 

1 3 
where Jj, J, J;... are known expressions involving only the r,,’s, and can be 
calculated as soon as the latter have been found from observation. 


Further the variability of Q about its most probable value, when we know the 
P’s, takes on the average the value o, x A, where A is another well-known and 
calculable function of the 7p,'s. 

The above is in no sense a biological theory, it is based on no data whatever 
except the actual statistics; it is merely a convenient statistical method of 
expressing the observed facts. If the facts are there it expresses them up to 
a certain point,—the most probable or most frequent value of the individual 
given his relatives. It cannot possibly within this range be upset by any 
hypothesis of heredity, Mendelian or otherwise, for it is based on no biological 
assumptions whatever. It is merely a convenient description of statistical tabula- 
tions. 


I now proceed to limitations based on actual experience. 
(i) The correlation coefficients between relatives are positive. 
There is overwhelming evidence in favour of this in insects, animals and plants, 
for direct relationship back to great-great-grandparents and in a considerable 
range of collateral relationships—fraternal and avuncular. 
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But it may be shewn that if the r’s are positive, increasing the number 
of relatives used as a basis of prediction reduces the value of A. The rate 
of reduction decreases considerably as we increase the number of correlations, but 
for the first increase from one on to two, three ... six, etc. it is very sensible. 


We therefore conclude that : 


There will be less average deviation from the probable character of an individual 
when we increase the number of relatives on which the prediction is based. 


As a special case we say that : 


Prediction will be closer when we use the ancestry as well as the parents in 
forming it, than when we use the parents alone. 


As soon as we accept the positive character of the correlation coefficients 
between offspring and ancestry this is an absolutely certain result. It could only 
be invalidated if the parents defined the offspring absolutely, i.e. if A for two 
parents was zero, or the correlation with parentage perfect. But this is com- 
pletely negatived by the fact * that the same pair of parents will produce offspring 
with very different characters,—e.g. two bay horses may give a black horse 
and not only a bay. Thus it seems to me that any hypothesis of inheritance 
which neglects ancestry is foredoomed to failure in the wide range of vital types 
for which we have shewn already that parentage does not define offspring 
uniquely. The variation in offspring of the same parents is not merely confined 
to slight differences in size of organs, but in many covers differences in colour, for 
example, as great as those upon which the Mendelians have based their dis- 
crimination between dominant and recessive characters. 


(ii) In actual measurement on many series the J’s come out less than unity 
and there are theoretical reasons for holding that this must always be the case. 


Hence if the variability of the offspring generation be not much larger than 
that of the generation of any relative, the probable deviation from type of any 
individual will only receive from any relative a fraction of his observed deviation 
from his type. 


This is Mr Galton’s principle of regression, the exceptionality of P, is on the 
average only exhibited in part by his relative Q. This purely statistical and 


legitimate conclusion was seized upon as a biological law, and all life, but for 
constant selection, was assumed to be ir 


a state of regression to some distant 
ancestry. The expression (i) does not warrant this assertion at all. To begin 
with there is no reason to suppose : 


(a) That o,=o,,=o,,=o¢,,=... or that the variability of all generations 
is the same. In numerous and large series I have not found it sot. 


* For ail the characters yet dealt with whether quantitative or qualitative in insects, animals and 
plants, there is no approach to A=0. 


+ The offspring generation tends almost universally to be more variable than that of any ancestral 
generation—parents are a selected portion of the community. 
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(b) hy,, hy,, hp, .-. ave deviations from the type of each generation. I have 


never found the types of each generation identical. They often differ very 
sensibly. 


(c) Each individual ancestor contributes only a fraction of his deviation 
from type to the probable deviation in type of the offspring. But if the ancestry 
be maintained for two or three generations at a given deviation from type 
the contributions of the different terms of (i) provide a probable deviation in type 
of offspring which may be equal to or may even exceed that of the ancestry. 


This point is over and over again forgotten when biologists talk of regression 
as if it were a persistent retrogressive factor. 

As a matter of fact with the numbers we already have for man, breeding true 
for three or four generations gives a value of p, within a small percentage of the 
selected h,, and this is only slowly modified, if the stock continues to breed true 
to itself. There is no such thing in statistical theory as a necessary regression if 
the selected stock pair with selected stock. The rapid establishment of breeds is 
not evidence therefore against the present view of heredity. On the contrary 
it flows at once from it. 


(4) So far there is practically no assumption in our treatment of heredity, 
which has not been justified by ample experience, e.g. the close linearity in 
distribution of the probable value of a character in one relative for a known 
value of a character in a second. Further it enables us to predict probable values 
from any group of known relatives. When however we predict from direct 
ancestors only we state more particularly the law of ancestral heredity. If we are 
content with parents, or possibly in some cases with grandparents also, we have 
material for making a fairly close prediction, but if we want to deal with whole 
lines of ancestry, we are met at once by the difficulty of collecting statistical 
material for the correlation of the offspring’s character with that of the higher ante- 
cedents. In man few observations or measurements have been made on a higher 
than the grandparental generation, and even in pedigree animals, where we can go 
much further back, the characters recorded are never quantitative, but concern 
colour or markings. To get over this absence of material Mr Galton originally 
proposed that we should correlate not with each individual ancestor but with 
the mean of each ancestral generation, females being reduced to a male standard. 
There is no assumption in this because, of course, correlation can statistically 
be worked for any such group of ancestry. But as we do not know the values of 
the correlation coeificients of the higher groups, Mr Galton suggested that we 
should take the J,, Jz, J;,... of equation (i) on p. 217 equal to 4, 4, }... respec- 
tively. This was undoubtedly an assumption, although not an unreasonable one 
@ priort. That the intensity of ancestral heredity diminishes as we go backwards 
is demonstrated by both experiment and observation; and a geometrical series 
naturally first arises as a measure of such diminishing influence. But the law 
could only be demonstrated on the base of the first few terms of the J-series, and 


28—2 











220 The Law of Ancestral Heredity 


Mr Galton’s evidence from stature only gave a rough test of the series 4, 4, 4,... 
having the right trend. He did not himself claim more for it than this; thus 
after suggesting that J,=4 and J,=4, he writes: “It would, however, be 
hazardous to extend this sequence with confidence to more distant generations*.” 

Referring again to this point, Mr Galton says in 1897: “I stated it briefly and 
with hesitation in my book on Natural Inheritance.” 

When I personally first came to investigate the matter from the more purely 
mathematical side, it seemed to me better to start with the coefficients of corre- 
lation and make no assumption as to the values of the regression coefficients 
in the J-series of p. 217}. Each coefficient found would serve as the basis of a 
limited amount of prediction, and we must plod away finding coefficients for as 
many generations for as many characters in as many races as we possibly could. 
When these were known, we should possibly be able to predict the relationship 
between the correlation coefficients of successive generations even beyond the 
limits of observation and experiment. If we had to make an hypothesis at all, it 
seemed to me most reasonable to suppose the correlation not the regression 
coefficients of each generation of ancestry to diminish in a geometrical series. 
This view I kept steadily before me, but it was very difficult to find material 
going sufficiently far back to test it. Theoretically I shewed that if these corre- 
lations formed a geometrical series then the regression coefficients, or J’s, would 
also form a geometrical series, if we neglected the effects of assortative mating§, 
but the data I had collected did not as far as they went justify Mr Galton’s 
proposed 4, }, 1... series. 


Taking the matter broadly we may say that the Law of Ancestral Heredity 
implied two ideas: 

(a) That the proper method to proceed in heredity within the race is by 
the statistical theory of multiple correlation—this does not exclude the truth of 
any physiological theory, although it may serve to confirm or refute such a theory. 
The correlation between parent and offspring in man will remain about °45 to ‘5, 
whatever theory of gametes may eventually be accepted, and the prediction of 
probable character in son from actual character in father remains equally valid. 

(6) That a knowledge of the nearer coefficients of correlation, i.e. those 
between offspring, and parents, grandparents, and possibly great-grandparents, 
will suggest the more distant ones, and that probably these, and consequently the 
multiple regression coefficients, are expressible as a geometrically decreasing 
series, 

This is the hypothesis involved in my own expression of the Law of Ancestral 


* Natural Inheritance, p. 136. 


+ R. S. Proc. Vol. xu1.p.401. I have elsewhere shewn that the evidence from Basset Hounds admits 
of other interpretations. R. S. Proc. Vol. 66, p. 140 et seq. 

} ‘Regression, Panmixia and Heredity.” Phil. Trans. Vol. 187, p. 253 et seq. 

§ “On the Law of Ancestral Heredity.” R. S. Proc. Vol. 62, p. 394. 
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Heredity, and I believe it is a generalisation, which Mr Galton might, perhaps, 
be willing to accept in place of his much more definite series of }, 4, 4, ete. 


(5) Assuming we had the data for quantitatively measurable characters one 
would prefer to deal with such, but failing this I have asked myself how far the 
successive correlation coefficients for ancestry in qualitative characters (which we 
may reasonably suppose to have a quantitative scale behind them) fall into a 
geometrical progression. In eye-colour for man I have previously given* the four 
parental and the eight grandparental correlations. Mr F. Lutz has completedt 
the series, as far as it is possible to do on Mr Galton’s material, by investigating 
the correlation of offspring with the great-grandpz#rents, arranged in four groups. 
Mr L. Bramley-Moore and I had given the four parental and two of the grand- 
parental coat-colour correlations for horses in 1899}. Mr N. Blanchard published 
two further grandparental correlations recently§, and he has now completed the 
whole series of eight, and they will be found in a note attached to this paper. 
Dr Alice Lee worked out the great-grandparental and the great-great-grand- 
parental correle 1° from the data provided by Mr Blanchard’s MS. pedigree 
books; these a.c given in a second note below. 


We have reached the following results: 


Mean Correlation Eye-colour in Man Coat-colour in Horse 
Parental 494.7 5216 
Grandparental ‘3166 ‘2976 
treat-grandparental 1879 1922 
Great-great-grandparental a "1469 


When we note that the probable error of these coefficients is about the order 
‘03, one is at once struck with their substantial agreement. We are forced to the 
conclusions : 


(i) That eye-colour in man and coat-colour in horses are inherited in 
sensibly the same manner. 


(ii) That the original series proposed by Mr Galton 4, 4... for these 
correlations|| will not fit them at all. 


It might be supposed that pigmentation followed some other rule than 
measurable characters. But in recent work on measurable characters for 12 
series, each numbering upwards of 1000 cases, I have found for man the mean 
parental value ‘46. This is the largest inheritance series for man yet worked 
out, being based on measurements of upwards of 1100 families, so I am pretty 


* Phil. Trans. A, Vol. 195, p. 106. 

+ See Note on the Influence of Change in Sex on Inheritance. Biometrika, Vol. 1. p. 237. 
t Phil. Trans. A, Vol. 195, p. 93. 

§ Biometrika, Vol. 1. p. 361. 

|| Natural Inheritance, p. 133. 
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confident that parental inheritance for man lies between *45 and ‘5, and is quite 
incompatible with a value of 4*. 


I now turn to the problem of how far the data can be represented by a 
geometrical series. I fitted the best geometrical series first to the horse only, 
then to man, and finally to both with the following results. 


A. Horse, observed Best Geometrical Series Man, observed 
52 “48 “49 
"30 "32 32 
19 21 19 1 
“15 "14 — 
B. Man, observed Best Geometrical Series Horse, observed 
“49 50 *52 
"32 ‘31 30 
“19 19 19 
= 12 ‘15 


In the first case, the series being a, ar, ar, ... we have: 


a =*4809 and r='6602. 
In the second case: 
a='5007 and r=°6167. 


If we take the best geometrical series embracing the data for both races we 


have : 
C. Horse, observed Man, observed Best Geometrical Series 
52 ‘49 “49 
30 32 32 
19 ‘19 20 
15 —- 13 


Here we have: 
a='4921 and r =°6428. 


Lastly, if we take a =°5, r = 2 as close round numbers, we have : 


D. Horse, observed Man, observed Close Series 
D2 “49 50 
“30 32 33 
19 19 "22 
15 — 15 


Within the errors of observation this close series represents excellently the 
observed results for either eye-colour in man or coat-colour in the horse. In other 
words : 


* As long as the only data for a measurable character in man—Mr Galton’s stature data—gave a 
parental correlation of about 4 it seemed needful to emphasise the distinction between the results for 
eye-colour and stature. In the light of my present knowledge, the distinction between the two classes of 
characters seems not so clear or needful. A large series of pedigree dogs at present in hand gives a 
parental correlation almost identical with that of horse and man. 
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As far as the available data at present go inheritance coefficients for ascending 
ancestry are within the limits of observational error represented by a geometrical 
series and by the same seriés. 


From this it follows that*: The contributions of the ancestry also follow a 
geometrical series, although not that originally proposed by Mr Galton. 


(6) Mr Galton has assumed, that if the relatives include all the ancestry, and 
if all these ancestry had the same deviation h, the offspring will have a probable 
deviation of h. This is really the introduction of a biological hypothesis, the 
truth of which can only be tested by observation. Mr Galton deduces this result 
in the following manner: he supposes a stable population, i.e. one in which the 
mean and variability of each generation remain the same, and the parents in each 
generation are the whole or at any rate a random sample of that generation; 
there must also be no reproductive selection, or fertility must not be correlated 
with the character of which the inheritance is under consideration. Further 
there must be no assortative mating. Under these circumstances we have the 
following form of (i) p. 217: 

Pg=Witdetdst+...)h, 
and since Mr Galton holds that p, will then equal h, we have 


pe Ee ee eae (2). 


Any geometrical series ¢(1+p+p*+...) for the J’s will satisfy this condition 
ife=1—p; the series }+4+4+,+4+... is not the only one satisfying (2)+. 


But it is doubtful +t how far this conclusion is justifiable. Statistically it is of 
course unnecessary. (i) is a relation between the probable deviation of Q from 
its mean and the actual deviations of each P from their individual means. 
There is no reason why the means of Q and of all the P’s should be the same. 
They may be different owing to environment or to selection. Further there is 
no reason why the variabilities should be the same; parents may be a selection 
out of the general community in each generation. As soon as we realise that the 
h’s are deviations from the generation means, and these are not all the same, and 
that the variability in each generation differs, the need for the relation 

Jitdetdst+...=1 


ceases to be apparent. 


In data like eye-colour in man and coat-colour in thoroughbred horses there 
has been undoubtedly a secular change going on; the proportion of blue eyes in 
the latest offspring is considerably less than in the great-great-grandparents, 
while the early grey horses have largely disappeared from the stud-book. 
Further we find each ancestral generation is roughly speaking less variable than 

* R.S. Proc. Vol. 62, p. 394. 

+ R. S. Proc. Vol. 62, p. 402, and Vol. 66, p. 147. 


t¢ It is hardly consonant, for example, with what we know of eye-colour, where there is an assorta- 
tive mating coefficient of -1002: see Phil. Trans. A, Vol. 195, p. 113. 
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the one below it. Men who live to be parents are a more select group than their 


offspring and this is still more true of grandparents and higher ancestry. To be 
a great-great-grandparent means to have a healthy stock, and such men form a 
selection only of the community in each generation. Hence neither for man 
nor the thoroughbred horse need we « priori expect to find, for the correlation 
coefficients as actually determined, the relation J; +J,+J;+...=1 or e=1l—p 
satisfied. The actual statistics for man and the horse conclusively prove that it is 
not satisfied. If we had a population in which all members were randomly mated 
and were parents of equal fertility; further if every generation had the same mean 
and variability, then we might have e= 1 — p, but then the correlations of ancestry 
and offspring would probably take very different values from what we find them 
to have in the eases of man and horse under existing circumstances. 


(7) Supposing all mating to be random * we can find e and p and all the J’s 
by the method of my first paper on the “ Law of Ancestral Heredity+.” In the 


notation of the present paper, if ar” be the correlation coefficient of “the n+ 1th 
mid-parent ” with the offspring, and if J, = ep, we find: 


PE WW siscnnentiensenesansitcncseheivctianel (3), 
—a+ epal(l— pr) He w.eee cee cee ecee ee eeeeeeee (4); 
whence : 
2 
p?*—p’ hd oad) See ee ETRE (5). 


Hence taking a =*5 and r= as close enough to the observed values we have: 
p =°2251, é ='4415. 


Formula (i) of p. 217 using “ mid-parents” now becomes: 
g I 
iy = 4415 sh 2251 2h , + (2251 s! \ 
Pa ‘ p, + S29 S, (* Ip inn 


Here >,, =., =; ... are the standard deviations of the mid-parental groups, and 
>, = a,/(V2)*, if ao, be the standard deviation of the sth generation of ancestors. 
§ 5 
Thus: 
“ 15 (o a" ) 
Pg = "4415 x V2 {— hy, +2251 V2 — hy, + (2251 V2) oie + oni 
jo, oo 


1 


- 6244/2 h, + 3184 2 h,, + (3184)? hy 
| Pi oO, 


a} 


a 


ao 
Py hy, + on + oe ee. 3 
lo, 2 P p, + p* j 
Clearly «= “6244 is not equal to 1—p or ‘6816, or Mr Galton’s hypothesis is 
certainly not satisfied either for man or horse. But since we know in these cases 


* R.S. Proc. Vol. 66, p. 149. 
+ R.S. Proc. Vol. 62, p. 394. 
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that o,=0,=0,;=... =o, is not true, and further that hy,, hy,, hp,... are measured 
from different means, we have no d priori right to suppose it ought to be satisfied. 


(8) It is not without interest to follow up what would happen in man and 
horse if, with our present numbers, we supposed the conditions for a stable popula- 
tion to be really satisfied. Let us select about the deviation h from the common 
mean for s generations, then the offspring would have a probable deviation from 
that mean of 

€ 2) h. 
l—p 

If we then ceased to select, but bred the offspring among themselves for ¢ 
generations, the offspring of the tth generation would have a probable deviation 
from the mean of 


ae . 





(V8) h. 


é€. er 


Let us turn these results into numbers, remembering that ¢='6244 and 
p= ‘3184. 


Result of selecting all ancestry for one generation = ‘62h 
”» » ” two generations = 82h 
” = = three a = "89h 
” - - innumerable ud = ‘92h. 


In other words indefinitely long selection would only raise uz to ‘92 of the 
desired character. We should therefore not get the whole of the desired character 
without selecting beyond it. But on the other hand by merely selecting for two 
generations we should get within ten, and for three generations within four per 
cent. of the possible maximum of indefinitely protracted selection. Hence selec- 
tion for a very few generations would raise the selection rapidly to within a small 
percentage of the maximum result. 


Generations after which selection is stopped and simple breeding-in of selected 
stock follows. 

















First Second | Third | Innumerable 
| Last offspring of selection... “62h "82h | "89h ‘92h 
blah aie ie oad Scat fie Pen Cea 
| First generation of in-bred stock | ‘59h “78h | "84h “86h 
| ly A Lyf . 
| Second m - < ‘6A ‘73h “79h 81h 
| Third a ie ss “52h ‘67h =| “74h ‘77h 
| Tenth & S "B5h "46h | 49h *‘blA 





Next let us suppose selection stopped and the stock to breed-in or mate with 
its likes. The above table shows how it would slowly degenerate. The essential 
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point to be noted is that while two or three generations of selection would carry us 
up to 80 or 90 per cent. of the desired character, ten generations following this of 
merely in-breeding without any selection would not have cost us 50 per cent. of 
the character so acquired. Quick influence of selection, slow effect of regression 
would be the result of combining the actually observed values with Mr Galton’s 
theory as to what should hold for a stable population. 


The reader must not forget that the illustration here given is absolutely 
hypothetical; the statistical constants obtained are deduced from material to 
which Mr Galton’s conditions hardly apply even as a rough approximation. Yet 
it is possible that something of the kind here indicated may occur in special 
cases. But if so, we ought to be very cautious of using vague categories in prob- 
lems of heredity. If the mean tint of a seed, say, be yellow, and h would carry us 
well into the green end of the scale, ‘4h might still be green, and certainly for three 
and possibly for a good many more generations we migh’ consider the stock arising 
from a single selection to be breeding true to itself, although actually it might be 
slowly regressing to the original tint of the early ancestry. It seems absolutely 
necessary in all such cases to have some colour standard and determine quanti- 
tatively whether successive generations do or do not tend to slowly approach or 
depart from it. The statement that ancestry has no influence might well be 
deduced by the use of a rough category, which would still class “62h with ‘52h. 


(9) Conclusions. 


(a) In all cases as those of man, horse and dog, where parents of identical 
character do not produce identical offspring, the theory of statistics shows us that 
closer prediction may be obtained when we predict from many instead of few 
relatives. This follows from the consideration that all the heredity coefficients are 
positive. 

Attention is therefore properly paid to ancestry in such cases, and it is very 
misleading to suggest that any law of heredity can be universal which neglects 
ancestry. 

(b) The law of ancestral heredity in its most general form is not a biological 
hypothesis at all, it is simply the statement of a fundamental theorem in the 
statistical theory of multiple correlation applied to a particular type of statistics. 
If statistics of heredity are themselves sound the results deduced from this theorem 
will remain true whatever biological theory of heredity be propounded. 

(c) The law of ancestral heredity as founded on the theory of multiple 
correlation involves no biological theory of regression. The term regression has 
unfortunately been taken from statistical theory and interpreted in a biological 
sense. In statistics the regression is always to the mean of the forecasted charac- 
ter, but no assumption is made that this mean is identical with that of the fore- 
known character*. Further, if there be a number of cognates, we can d priori, 


* There is a ‘‘regression” for example if we predict breadth of skull from its length. I think it might 
be useful to adopt the word “ predicate ’’ for the forecasted and ‘‘ cognate ” for the foreknown character. 
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i.e. before quantitative analysis, not state whether the total amounts they will 
contribute to the predicate will or will not indicate a biological regression *, 


(d) The law of ancestral heredity as a pure statistical statement has been 
supplemented by hypotheses which need verification and are semi-biological. These 
may (i) either be hypotheses as to the multiple regression coefficients, or (ii) as to 
the individual coefficients of correlation. 

Mr Galton has assumed that the former are the geometrical series }, }, 4... 
I have assumed that the mean correlation coefficients for each ancestral generation 
form some geometrical series. It follows from my assumption that the regression 
coefficients would also form a geometrical series, but not necessarily Mr Galton’s. 


(e) In eye-colour in man and coat-colour in horses the mean ancestral 
coefficients of regression form within the limits of errors of random sampling a 
geometrical series, but it is not Mr Galton’s series: 

5000, 2500, 1250, -0625..., 
but more nearly : 

6244, 1988, °0630, -0202.... 
In other words actual statistics show that in man and horse the parents are much 
more and the grandparents and higher ancestry less influential than on Mr Galton’s 


hypothesis. 

Thus the law of ancestral heredity (by which we are to understand the theory 
of multiple correlation together with the hypothesis that the mean ancestral 
correlations or the regression coefficients form a geometrical series) fits the data 
for horse and man remarkably well. 


(f) In man and horse we find the means of each generation differ, and 
further the variabilities of each generation differ. It is an assumption to suppose 
under these circumstances that the sum of the regression coefficients (or rather that 
part of them which we have represented by Jj, J2, J, ...) is unity. 

Any geometrical series for the regression coefficients which satisfies the con- 
dition ¢e=1-—p (like Mr Galton’s does) would give on the supposition of equal 
means and variabilities for each generation no regression whatever after a stock 
began to in-breed. 

If we may apply (which is very doubtful) our values for the J’s in man and 
horse to cases in which the means and variabilities of each generation remained 
the same, there would result the following principle : 

Two or three generations of selection would produce a stock of upwards of 90 
per cent. of the selected character, but no amount of selection, unless of a greater 

* It is curious that the original numbers selected by Mr Galton for the regression coefficients 
3, 4, 4, etc. indicate no regression whatever towards the predicate mean, after the first generation, if 
the stock in-breeds or breeds with its likes. It is characteristic of how conceptions are misunderstood, 


that this point of ‘‘regression” is what the majority of biologists have seized as the one easily 
comprehended principle out of the whole of Mr Galton’s work! 


29—2 
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than the desired amount of character, would give us more than 92 per cent. of it. 
After selection ceases a very slow regression sets in, which would be hardly 
perceptible without very definite quantitative measurement for the first three or 
four generations of in-breeding. 


No rough classification like a Mendelian category would enable us to test 
whether such slow regression is really taking place, i.e. whether “recessives” are 
really breeding true to their stock*. And above all no Mendelian theory can 
(a) replace the statistical treatment unless it shows that offspring are absolutely 
determined by their parents, for if the correlation be not perfect, a better prediction 
must be obtainable by using the positively correlated ancestry, or (b) be proved 
without careful quantitative analysis of the variations falling within the wide 
categories adopted. 


Appendix I. Note on a paper by G. U. Yule. 


This paper was written before I was aware that Mr G. U. Yule was writing some account 
of the law of ancestral heredity, and was sent to press before I had seen his paper (The New 
Phytologist, November and December, 1902). With much of his paper, I agree, for example, 
with his insistence on the point that the laws of intra-racial heredity are not incompatible 
with Mendelian principles holding for hybridisation. I should lay, however, far less stress than 
he does on the value of the existing evidence in favour of those principles. That evidence 
certainly justifies the making of new and crucial experiments, but these will have to be made with 
much greater caution and closer quantitative definition of the categories employed before we can 
say with certainty how far either of the two principles of dominance and segregation actually 
applies even to the cases of hybridisation already dealt with by the Mendelians. I shall be 
surprised if the laws which govern intra-raciai and inter-racial heredity are not more closely 
related than appears to me possible under the recent formulation of Mendelian Principles by 
Mr Bateson. Further Mr Yule does not seem to me to have clearly expressed my personal 
position with regard to the law of ancestral heredity. I believe that iny memoir of 1896 was the 
first in which the equations of multiple regression were worked out and applied to the problem of 
heredity. Such equations are, I presume, what Mr Yule refers to and says may be termed the 
law of ancestral heredity. But the objection to such form of the law is simply the impossibility 
of working it out for all the ancestry, and for all characters of all races. The problems that arise 
at once in our minds are: Is there any relationship between the correlations of offspring and 
successive ancestors? Are these correlations in cases, where direct selection of the characters 
is small or can be allowed for, the same or approximately the same for different characters of the 
same race, or for divers characters of different races? Is there any relation between the corre- 
lations, which appears fairly in accordance with observation and yet will simplify the otherwise 
appalling complexity of the multiple regression equations? The memoir of 1896 gave the 
general regression equations, and indicated what results would flow supposing one or other of a 
variety of hypotheses due to Galton, Weismann and others were true. The memoir of 1898 
adopted the simpler hypothesis that the correlation coefficients decrease in geometrical progression, 
it did not involve the fixity of the numerical constants of heredity which Mr Yule tells us has not 


* For example, if we selected extreme chestnut (towards roan-chestnuts) for two generations only in 
the horse, yet if the progeny in-bred for 10 generations, the offspring would still be well in the chestnut 


range of coat-colour, and no category like chestnut would tell us whether the horses were breeding true or 
regressing. 
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stood the test of time. This simpler hypothesis, discussed in the present paper, still seems to me 
to stand the test of time. Lastly, I think, the reader of Mr Yule’s paper would believe from the 
words : “Selective mating, natural selection, reproductive selection, the effect of circumstance, 
had all in turn to be recognised as causes affecting the values of the constants of heredity,” that 
these factors were introduced post hoc, i.e. after observation had failed to give fixity of constants, 
whereas almost the reverse is the true case. The definition of selective mating was given in my 
first paper to the Royal Society on biparental inheritance in 1895* before I had applied the 
formule to actual data at all. Reproductive selection was considered in 1896+, and the influence 
of natural selection on correlation (including the coefficients of heredity) was given in College 
Lectures of the session 1896-7, and put into a form identical with that of the memoir just issued { 
in August 1897. The memoir of 1898 on the law of ancestral heredity was written with a 
knowledge of the influence of the three factors of selective mating, natural selection and repro- 
ductive selection on the constants of heredity. As a matter of fact they had been biometrically 
defined in or before the memoir of 1896, and were not “all in turn recognised as causes affecting 
the values of the constants of heredity,” or used to bolster up some theory which had not stood 
the test of time. 


What I felt in 1896, I still feel, namely: that the problem of heredity from the purely statistical 
standpoint is an extremely difficult one, that the factors of selective mating, natural selection, 
reproductive selection, environmental influence are extremely complex, especially when we place 
plants or insects in very artificial surroundings for the purpose of experimenting on their laws of 
reproduction. Still these factors do not invariably tend in one direction, and when we take long 
series, as free from their influence as is practically possible, we shall be able to judge from the 
clustering of the heredity coefficients round certain values—a phenomenon which I think is 
obvious in existing results—what in broad lines is the quantitative intensity of heredity for 
different relationships, and for different characters in different species. Mr Yule indicates that 
he has a theory which allows him to determine the influence of the environmental, reproductive, 
selective and mating factors; I can only say that I, as well as most biometricians, will heartily 
welcome its publication, if it not only allows, in the case of any heredity statistics which it is 
feasible to cellect, for the correcting of the heredity constants for all these influences, but achieves 
this by some hypothesis which is @ priori as reasonable, and @ posteriori as justified as the simple 
one that ancestral correlations diminish in a geometrical progression. 


Appendix II, On Inheritance (Grandparent and Offspring) in 
Thoroughbred Racehorses. 


3y NORMAN BLANCHARD, B.A. 


(1) Ar the suggestion of Professor Karl Pearson I have recently worked out the remaining 
four cases for the inheritance of coat-colour in thoroughbred horses, viz. those between 
foals—colts or fillies—and their paternal and maternal grandams. The work was done in a 
somewhat different manner to that on the four earlier tables. In order to consider the bearing 
of coat-colour inheritance on Mendel’s theory, actual colour pedigrees were formed going back to 
the great-great-grandparents and in some cases to the fifth and eighth generation of ancestry. 
Further, in order to get more striking colour contrasts than are now-a-days possible, the first 
three volumes of the stud-book were dealt with, I thus went back to the earliest records of coat 
colour in thoroughbreds. Here we find 2 to 3 per cent. of black horses and 9 to 10 per cent. of 

* “On Regression and Inheritance in the case of two Parents,” R. S. Proc. Vol. 58, 1895. 
+ “ On Reproductive Selection,” R. S. Proc. Vol. 59, p. 301, 1896. 
t Phil. Trans. Vol. 200, pp. 1—66, 1902. 
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greys, while in the recent studbooks these percentages are hardly 2 per cent. and ‘5 to 1 
per cent. respectively. In the early colour records the finer distinctions, eg. “black or 
brown,” “bay or chestnut,” etc., of the modern terminology rarely appear, but this is not 
of importance. A thousand cases were taken for each series, and the correlation Tables I. to IV. 
were constructed as in Pearson and Bramley-Moore’s memoir on Inheritance of Coat-Colour 
in Horses*, in fact the method there used was followed identically, the fourfold division between 
chestnut ana bay being used for the short tables. The following results were obtained, the 
notation being that of Professor Pearson’s memoir: On the Correlation of Characters not quanti- 
tatively measurable t. 


(2) Table I. reduces to t 


Paternal Grandams. 


Bay and | Chestnut and Totals 
darker lighter —— 
# | Bay and darker 486 188 674 
5 Chestnut and lighter 172 154 326 
Totals 658 342 1000 
ee ee ee eS ee ee 


This gives : 











h="4070105, 
k=-4509851, 


and the equation : 





H = ‘3672283, 
K ='3603667, 


3212093 =r + ‘09177787? + °11077467° + 060628374 
+ °03085827°% + ‘044342176 + 010190977 + 033924375 + etc. 


the root of which is 


From Table 


giving : 


II. we have 


r= "3085. 


Maternal Grandams. 


























Bay and Chestnut and Totals | 
darker } lighter — | 
| 
% | Bay and darker 539 149 688 
S | Chestnut and lighter 198 114 312 
oO 
Totals 737 263 1000 | 
— 4 





h=°6341100, 
k=-4901880, 


H =*3300624, 
K =:3537792, 


* Phil. Trans. A, Vol. 195, pp. 92 et seq. 
+ Phil. Trans. A, Vol. 195, pp. 1—47. 
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and the equation : 
2735648 = 7 + 15541657? + -06318957° + 092854574 


+ 010088275 + °06086707* — 000734377 + -041267875 + etc. 


the root of which is 
r=°2614. 


From Table III. 
Paternal Grandams. 





| 














ee 
Bay and | Chestnut and Total 
| darker lighter se 
nn | 
.© | Bay and darker ... 525 189 714 
= | Chestnut and lighter 175 | 111 286 | 
ca | | 
Totals 700 | 300 1000 
L ae 5s ee he el ee 
giving : 
h="5243986, H=-3517190, 
k=:5689150, K ='3393338, 


and the equation : 
°2111434=7 + °14916127° + 08172467° + ‘090658174 


+ 013814375 + 06061337% + ‘001463377 + 042079875 + etc. 


the root of which is 
= "2041. 


From Table IV. 
Maternal Grandams. 


























Bay and | Chestnut and ata 
darker lighter Totals 
a | 
.© | Bay and darker 542 163 705 | 
= | Chestnut and lighter 187 108 295 
em 
| 
Totals ak. ae 729 271 1000 
giving : - rf 
h=°6097900, H=-3312572, 


k='5388334, A ='3380886, 


and the equation : 
*2505039 =r + °175015272 + 070196873 + 097663374 


+ 008860275 + -06308857% + 000121677 + 041992178 + etc. 


the root of which is 
r= *2392. 


(3) 
corresponding ones for eye-colour in man for the sake of comparison. 
been determined by Pearson and Bramley-Moore* or myselft. 


* Loc, cit. p. 93. 
+ Biometrika, Vol. 1. p. 361. 
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The eight grandparental correlations for the horse are given below together with the 
The whole series has now 
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| 


Pat. Grandfather and Son sie 
Pat. Grandfather and Daughter 
Mat. Grandfather and Son at 
Mat. Grandfather and Daughter 
Pat. Grandmother and Son 





Mat. Grandmother and Son ... 
Mat. Grandmother and Daughter 


Co ND Tie’ Co MOM 


| 





Mean... 


| 


TABLE I. 


Pat. Grandmother and Daughter | 





Colts and Paternal Grandams. 


Paternal Grandams. 


| br./bl. 





br. | br./b. | b./br. |b. | b./ch. | ch./b. | ch. |ch. r 


Horse. Man. 
Coat-coiour Eye-colour 
3238 “4213 
3609 “3802 
3590 3717 
3116 ‘2969 
*3085 2722 
2041 2205 
2614 2623 
2392 “3180 
2961 3179 


It will be seen at once that the mean result for men is substantially in agreement with that 
We can hardly suppose grandparental correlation for the characters in man and 
horse dealt with to differ much from 3. The mean value of grandparental correlation in 
the horse for cases 1 to 4 drawn from the modern studbooks is ‘3388 and for cases 5 to 8 drawn 
from the older records ‘2533. 
the horses in the earliest period of breeding. 


on the somewhat rougher appreciation and description of colour. 


I believe this is due to the close relationship of so many of 
The difference may also to some extent depend 


If there be any influence 
of change of sex in the intensity of heredity for coat-colour in the horse, it is not manifested in 


the above statistics, which thus differ widely from those of eye-colour in man. 


0. ro./ch. | ro. | ro. gr.) gr./ro. 





| 


Colts. 











oan ed fe 
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Colts. 


Fillies. 


N. BLancus ep 


TABLE II. 


Colts and Maternal Grandams. 


Maternal Grandams. 
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bl. | bl./br. | br./bl. | br. | br./b. | b./br.| b. b./ch. | ch./b. | ch. |ch./ro. ro./ch.| ro. |r0./gr. gr./ro.| gr. [Totals 
| bl. oe RS eee, i fee | ma | - Eh De Pe | ee ee 
| bl./br. — a _ = = -{—] — — —s aa Ss a ee ae oe = 
br.fobl. f—| — | - —|— - 1 -- 1}; — = = — 2 | 
| br. Le a eee | — | s8/ — | — j17} — 1 i a — | 20/8) 165 | 
Co el RR i tes i ee es i ee es pe ee, Oe) hice al 
| b./br a a. res ee, A cls —| — anes oe aks eee << fh See ce 
| b. 6| — — |}%3| — — |300; — | — | 7%| — r |f—| — — | 35] 500 
| b./ch. = — —- — — — — — = ns — = = — aan | 
| ch,/b —_— | —_— —- — — | — — — meat — oa. — 8 =r | 
ch. 3 — — 29 -- _ 132; — — 58 -- — 2 —- — 14 238 | 
| ch./ro. | — — | — |—-| -— en ae est, a = = a 
| ro/ch, J — | — | — — — |}—j — — t—f aca (y= se —|-} —| 
| ro. — | a 1 — — 1} —|-— 1 | a — 1 — = 1 5 | 
|ro/gr, f—| — | — — |—}j — =f a ey Se Gee 
- aS Sees ae 
| st 2}—]}—|7/—]—|2}/—|—|7]/—] —|-| —]| — |ao] 6] 
| Totals J 14) — | — [150] — | — | 573 — | _ | 164| = | 2 | 4 — “7 93 | 1000 | 
TABLE III. 
Fillies and Paternal Grandams. 
Paternal Grandams. 
bl. | bl./br. lbr./bl.| br. | br./b. | b./br. | b. |b./ch. | ch. b. | ch. |ch./ro. | ro. ch.| ro. | ro./gr. | gr./ro.| gr. Totals | 
| u u { | | | | | 
bl. —|—|—j;,5)—|]—]a}—|—-—] 9} —| —|-| —|] — | 19 2] 
eS Se ee ee ee ee ee eee eo ee ee ee ee 1| 
brbl. J—| — | — = = 1 —}-—|-—-|]-|]-|-|-|- 1 | 
br. fe 38 — | 83); — — 31 —— — — — — | & 166 | 
| brjb, F— | — — }— eo ee eee Fe -_f =— Jape | = fe 1 
b./br —| — -— — - }— | — — oe -— a _ — - — |j— - | 
|b. 5) — - |91} — | — Jao} — | - {109} — | — |—]| — | — | 82] 517] 
| b./ch. =| — me ss Se eee es ae }; — |—] - n= —|— — |— - | 
a i—| — -}—-| — | —|—| —| — J=] -—} -jJ-| -|-J/-] =) 
| ch. 2; — | — |] — | — jn3} — —- | 6; — | — j|—}] — | — | 14] 214 | 
| ch./ro —|— — — | — |— sins 7 = oe ics —{|—-—|- 1 
ro./ch —|— _ Sa —_ =) oa —i—) — | — |—F — 
| ro. —/—];1/—]—] 44-]-J-]-|]-J-]- | sae, | Daa 5 
| ro./gi —|—- -- -- — | —|{|— | — |j—-| — _ —-|— — |—- — 
| gr./ro — - _— | = | —_ — =: ate fw: ae nee — = — ee idk 
gr. 1} — | — ] 12] — — =) = eS | le — |] 66 
| | | | 
| Totals J 26] — | — {172} — | — [502] — | — jam! — | — |—| — | — |2 1000 | 
Biometrika 1 30 
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TABLE IV. 
Fillies and Maternal Grandams. 
Maternal Grandams. 
a Sa — - —— . _ —- - a } 
bl. bl./br.| br./bl. | br. | br./b. | b./br. | b. | b./ch. | ch./b. | ch. \ch./ro. | ro. ich. | ro. |ro./gr. | gr. ‘ro. | gr. |Totals| | 
bl. — | — 4 | —_ 9; — — | 5 — —_ —- | - — 6 25 
bl. /br. —{—{—-}|-—|-J-|}-}]-y1t}—-—|-j-|-|- {- 1 
br./bl. J—| — | — | -—-| — — {—|, — | — iti - —j-|] — — |= 1 
br. 1} — | — |33] 1 | — | 84] — | — |} — | — | 1] — | — | 7] 186 
bib F—|/ — | —|—-| —} —! 4) —]} —}—-| —| -—J}-| -| -I- 1 
b./br. — —_ — — _ — |— — — |— Te — —|{|— — —_ 
b. 6; — | — |so| — | — {322} — | — | 7%); — | — |—| — |] — | 40 | 521 | 
bjch. F—| — — j—-|] - — |j-| — —}j-| —- —|—| — —i—ai — 
ch./b —| — | — j-| —- — a _ — fw) = a ital WR — |—4j — 
ch. 2} — | — jiz7j — | — |iss} — | — joo) — | — |—| — | — | 15] aaa 
| ch./ro. | — — | — |— | — — |j—|] — —|—-} — == at —- |j—]f — 
ro./ch. — —- | — — _— — | —_ — — |j-—-| — — — — —_— j— — 
ro. ee ee | —-|—]}]2—;—|]—-| - - —|}—|{-— 3 
ro./er. _ — — | — - — - — — —_— — — — - 
5 | | 
gr./ro. J — | — — — _ - —| — —-— | =i — — oe. as = — | 
gr. 1} = 9) — -~j|i7j} — —| 8| —| — |—| — | — | 25] @ 
Totals} 11) — | — j144) 1 | — 573! — | — a7} — | — | 1] — | — | 98] 1000 
Appendix III. On Inheritance (Great-grandparents and Great-great- 
grandparents and Offspring) in Thoroughbred Racehorses. 
3y ALICE LEE, D.Sc. 
From Mr Blanchard’s racehorse coat-colour pedigrees, I have, paying no attention to sex, 
been able to extract 1155 cases of great-grandparent and offspring and 978 cases of great-great- 
grandparent and offspring. When it is noted that there are 16 types of great-grandparental and 
32 types of great-great-grandparental relationship, so that 48 correlation tables would be required 
for the full working out of these cases, it will be noted why in this preliminary study, I have 
not differentiated between the sexes. 
Tables I. and II. reproduce my data. 
{ 
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From Table I. I find in the fourfold division hitherto adopted : 


Great-Grandparents. 











Bay and | Chestnut and 
darker | lighter Totals 
to | | 
‘*e | Bay and darker... 601 264 865 
Z Chestnut and lighter 166 124 290 
= | 
; | i ne 767 388 1155 








Hence 
h=671,102, &=-423,602, 


and the equation is: 


198,115 =r +°141,8137° + 075, 1667° + 085, 1827 + etc. 


giving as root 


From Table II. we have: 


Great-great-Grandparents. 








Bay and Chestnut and Total | 
darker lighter —— 
vb | 
‘= , Bay and darker... 497 | 252 749 | 
es Chestnut and lighter 130 99 229 | 
5 
ee aes: ee 627 | 351 978 | 
| 








I find 
h="725,258, 


k= 361,420, 
and the equation 
149,992 =r 4+°131,061/2 + 068,681/° + -077,532r! +ete. 
which gives 
r="1469. 

We see from these results that the ancestral relationships in coat-colour for the horse are far 
more intense than was originally supposed for the like grades of relationship. We have in 
round numbers : 

Parental correlation ion ao ho 
Grandparental correlation ... ... =°30. 
Great-grandparental correlation .... =*20. 


Great-great-grandparental correlation =°15. 


As far as this series goes, not the coefficient itself, but the difference appears halved at each 
ascent, and it is clear that the 4, }, }, jy,.-. series cannot possibly hold for the regression 
coefficients of the multiple correlation series. , 

30--2 
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Offspring. 
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TABLE I. 
Great-Grandparents and Offspring. 


Great-“‘randparents. 















































bl. | bl./br.|br./bl.| br. | br./b. | b./br. |b. | b./ch. | ch./b. | ch, |ch./ro.| ro. ch.| ro. | ro. gr. | gr./ro.| gr. | Totals 
bl. 4| — — |17}; — — | 4; — — | 37); — ai — 2 96 
eS ee ee ee es es ee en ey ae eee eae, ey Pee ee ee es 
sh Sa ee eee ey ee Fs as Gar | ak ey Re, fe, es ne 
br. 6}; — — |}3| — — 97; — — eo -—| =— - — — 9] 196 
br./b. _ —_ — = —_ —_— | — — ae _— —_— - — — — 
b./br -;};— — —|;— — j;—| — — | } — — j—|] — —_ |}— 
b. 7] — | — jas] — | — |ao7] — | — [ro] — |] — [-—] - - | 98] 573 
bjch. J—| — — |}—}] — — i—}] — — | | — - |— - — |— — | 
ch./b — — |—| — — |}—/| — — | —| — — |—| — — |—f — | 
ch. 1 — os 34 — — 103; — — 75 -- - _ —- — 22 235 
ch./ro. — — — — — — — = — _ — —j — 
ro./ch. | — —_ — - ~- — — — — —|— _ — — = —~ - 
ro. SS ne es 8 ee eee ae ee eee i= oe 2] 16 
ro./gr. | — — — — — - = _ —_— — — — —- — 
gr./ro. — — —-| — —- |— _ — };—| — -- _ _ - |— - 
gr. - -- — 5 — = 13 Wo ae 8} - — - 39 
Totals | 28 — | — |175} -— — |564 - - |317; — - — _ — 71 f 1155 

| | | | | 
TABLE IL. 
Great-great-Grandparents and Offspring. 
Great-great-Grandparents. 
bl. |b. br.| br./bl.| br. | br./b. | b./br. | b. | b./ch. | ch./b. ch. |ch./ro.| ro. ch. | ro. | ro./gr.| gr./ro.| gr. [Totals 
| | : | | 

bl. 2 = - 14 . | = 40 | | — | ei — Se a ee = 7 87 
bl./br. | — ee - |— = a cee - —| — =e 
br./bl. | — OT a ee, eee a ee es eee = = , oe. a 
br. bi. ~ -| — | oo} — | — | 43) — Se 6] 142 
br./b — - | — - — | - — - — 
b./br. — = ~ . | — |- _ ~ ~ — — = as 
b. 5 — 70 —- | — |273 143 — -= 29 | 520 
b./ch — — — i—|i— — _- —_ —_— —_ — = - 
ch./b. | — - — s2ek oe 8 eof eee — |— —- —|— — —-j— 
ch. 1 -| — )l7] — | — | 89} — Mi w }-s = ee - | 19] 187 
ch./ro. | — — — . - } — — - — — - - -- — — _- 
ro./ch. | — — — |-— — = - - —|— —|— — - 
ro. ;— 2 — |- 7 - 5; — = — — l 15 
ro./gr. — ~ - - — — —} — 
gr./ro _ - - = bs - — -j — 
gr. = — 6 - _ 8 10 — — a 3 27 
Totals J 11) — | — |130) — 486) — - oe — | mm | -| — | — | 6] 978 








MISCELLANEA. 


Note on the Influence of Change in Sex on the Intensity of Heredity. 
By FRANK E. LUTZ, M.A. 


(1) In Professor Karl Pearson’s memoir, On the Inheritance of Eye Colour in Man* it 
is stated on page 117 “‘that the younger generation is more highly correlated with an ascendant 
or collateral of the same than of the opposite sex,” and again page 115, “that change of sex 
weakens the intensity of heredity.” It seemed worth while pursuing this point further and 
grouping together all the available evidence as far as eye-colour in man is concerned. Accord- 
ingly, Mr Galton having again most kindly put his eye-colour data at Prof. Pearson’s 
disposal, I have worked ov* vue correlation between great-grandparents and offspring, which 
completes as far as ancestral correlation is concerned the whole of the available records of this 
sort. I hope shortly to examine in the same way a large mass of data which I have gathered 
concerning coat-colour in dogs of the Pointer breed. 


I use the following scheme of notation for pedigree : 














0¢g 19 Offspring 
f _— uy 
2¢ 39 Parents 
r : 1 i . 71 
4g 59 6g 79 Grandparents 
c—t—_ e. . 1 | : 1 [ : 1 
8 9 10¢ 11? 12¢ 139 14¢ 159 Great-grandparents 
& } 


This only differs from Mr Galton’s widely accepted scheme in the introduction of 0 for the 
male offspring to distinguish them from 1, the female offspring. ,, will mark the correlation 
between a pair of relatives one of whom is in class s, the other in class s’; and a reference to 
the above scheme will at once show us which pair of relatives is being dealt with. Thus 7, ,, is 
the correlation of male great-grandchildren with the father of the maternal grandfather, and 
so on. 


(2) Unfortunately the data for great-grandparents is not very extensive. We had altogether 
only 2996 cases, or about 187 for each of the 16 possible great-grandparental correlations. This 
is distinctly too few considering the magnitude of the probable errors which occur when we deal 
with quantities not quantitatively measurablet. Accordingly I paid no attention to the 
separation of sex in the offspring but grouped my great-grandparental data into four classes : 
(a) those in which there is no change of sex between great-grandparent and offspring, (b) those 
in which there is one change of sex, (c) those in which there are two changes, (d) those in which 


* Phil. Trans. A, Vol. 195, p. 102 et seq. 
+ Phil. Trans, A, Vol. 195, p. 42. 








| 
| 
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there are three changes. I further classed Professor Pearson’s constants for parental and 
grandparental data* in the same manner, taking the average of the coefficients obtained by him 
for each class. 


The results are as follows: 


Parental Inheritance. 


Mean 79,9; 7), 3=°5300 


ie ae ... No change of sex. 
Mean 7y,35 71, 2= "4094 


oes eee a .-. One change of sex. 


’ 
Mean 79,4; 1),7='3697 ee oe . wa ee ... No change of sex. 
Mean 15,55 71,49 1,69 %0,.7= S004 «-- ee nas ne ..- One change of sex. 
Mean 79,6) 71,5= 2961 na Se ‘or oes see “te ... Two changes of sex. 
Great-grandparental Inheritance. 
tGroups 8 and 0, 15 and 1 give r='3471 vee de ao ..- No change of sex. 
Groups 9 and 0, 11 and 0, 15 and 0, 8 and 1, 12 and 1, 14 and 1 give 
r= "2217 ave oe ut eae ues is i ... One change of sex. 
Groups 10 and 0, 12 and 0, 14 and 0, 9 and 1, 11 and 1, 13 and 1 give 
r='1452 Rae a eee es ae aes ett ... Two changes of sex. 
Groups 13 and 1, 10 and 0 give 7=°0375 oe adi ee ... Three changes of sex. 
The mean result of the four great-grandparental coefficients here found is ‘1879, or if all the 
material be clubbed together and one correlation table formed for it, it is *1824, the two 
results being practically identical, considering the probable error of their difference. At the 
end of this note I give the actual correlation tables, remarking that the classes are those of 
Mr Galton’s eye-colour classification, i.e. 
1=Light blue. 5= Light brown. 
2=Blue, dark blue. 6= Brown. ’ 
3=Grey, blue-green. 7=Dark brown. 
4=Dark grey, hazel. 8=Very dark brown, black. 
For purposes of calculation the fourfold table was formed by classing together 1-3, and 4-8. 
Now, although the probable error of these results runs from °033 to ‘055, their invariable ‘ 


decrease with an increasing number of changes of sex is highly significant. It can hardly 
be said that there is a constant factor of reduction for each change of sex ; but, if it existed, a 
more elaborate system of measurements would probably be required to satisfactorily determine 
it. Still we can conclude with absolute safety for eye-colour in man and probably therefore for 
many other characters in other forms of life that : 


Every change of sex in the line of ancestry sensibly weakens the intensity of inheritance. 


It is desirable accordingly in future heredity experiments to record carefully such changes 
for they can obviously be of much significance. We may even see in them some justification 
for refusing to admit inheritance of title through the female line, although the objection would 
apply equally to the transfer from the female through the male line. 


* Phil. Trans. A, Vol. 195, p. 106. 


+ The results here are obtained, not by taking the means of correlation coefficients, but by clubbing 
groups together. 





Great-grandparents. 


reat-grandparents. 
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Great-grandparents. 


TABLE 
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Offspring. 
| = 
1 | eis|4lslel|7|s Totals| 
i i 
’ Offspring. 
1 |—|a/-|—|-|—|-|-] 2] ¢ so 
2 | 1 | 51| 35} 29] 2)15| 7/5 7145 | 5 | ol, ala| 
3 75 | 33) 31 | 13 | — | 6 | 13 101 | = | lle ati putale) 
—| 7] 4] 91/1] 413 |—f4 98 os l 
S ph Bd hd BES ee oe | 1-3] 157 | 49 | 206 | 
6 |—| 7] 8] 6|3|10\/9|—] 4 Saad Bed Mid endl 
7 J—| 8] 8] 7T]/—] 2] & 1 BF as 2 | 
8 1 2 24) Se St oe ee 12 | 2 | Totals} 247 121 368 
| | S - 
\Totals| 7 | 109 | 90 | 67 | 6 | 37 | 40 | 12] 368 | 
¥*h = °44322, k=-15042, 
0231974 + 1309073 + 0333372 +7 — 35688 =0, 
r= "3471. 
TABLE Il.—One Change of Sex. 
Offspring. 
| | | a ae 
1|2|3|4|5)|6| 7 | 8 JTotals 
Offspring. 
1 4 ae 8 5 | — 1} —|— 21 2 eine ea , 
2 8 |177| 95 76) 5 | 39} 31/17] 448 3 1—3 | 4—8 | Totals) 
3 1 | 69) 85 52) 2) 20) 26) 1] 256 I 
He Bod Bes Big Gad al MR ri fe’ Z| 1-3] 450 | 275 | 725 
Tk aad Galt aad at ae ee & | 4-8] 185 | 203 | 388 | 
6 | 2, 37| 27 17| 3| 30] 20| 4] 140 Ahmed! Bead Bd Bell 
’ — | 15] 20; 24] 3 4; 9| 9 84 y= | 
8 a 10) 13| 19 2 2 7 5 51 3 | Totals 635 478 1113 | 
| { rr ! a 
Totals] 21 | 345 269 | 213| 17 | 103 | 108 | 37 | 1113 
h='38909, &='17772, 
0243674 + °136977° +- 0345872 + # — -22496 =0, 
p= 2217. 
TABLE I11.—Two Changes of Sex. 
Offspring. 
1|}2\|3|4|5161|7 | 8 JTotals 
: Offspring. 
| 2 | 5] 4!) 9| 1] 2 } 1| 3] 2% 2 Bae. 
| g 6 |175| 86) 80); 4] 27] 34] 12] 424 3 _@| 4-9 Irotale 
| 3 | 3} 68| 94] 57| 2/27/27] 4] 289 Z ~? | +s 
4 i 28} 2 : 5 : 90 = : 
| > Ee Bind Sart voy Bary endl (es | ie 2 | 1-3] 450 | 981 | 731 
| 6 | 1] 34) 30] 34] 1/20/14] 5] 130) &| 4%] 208 | 190 | 398 | 
| 7 1 37] 34| 25; 1/16/14}; 4] 132 3 |= 
| 28 1 4) a “Si = 4 | 7 2 32 S |Totals] 658 471 | 1129 | 
o} BTS Sh hae 
Totals] 22 | 354 | 282 217 | 13 | 99 | 111| 31 | 1129 | 
= en Meee Pee Pee: Se 2 ; | 
h='37852, k=-20911, 
‘0278574 + 1365473 + 0395872 +7 —+14642=0, 
r= 1452, 
* The notation is the same as that used by Pearson, Phil. Trans. A, Vol. 195, pp. 1—47. 





Freat-grandparents. 


Great-grandparents. 


Miscellanea 


TABLE IV. 
Three Changes of Sex. 
Offspring. 


Totals| 





Offspring. 

8 : 
123 
99 
37 
3 
48 
64 
4 


Sw 


—_ 


mr bo bo mR OR te 


1—3 | 4-—8 | Totals 





it Co H & 


141 | 89 230 
92 | 64 156 


|Totals] 233 153 386 


bo = bh 
— eT & Ot 








—_ 


reat-grandparents. 


& 








G 














Totals 8} 5 | 29 386 | 


h=-24264, k=-26275, 
0229074 +-°1460373 + 031887? + 7 — 03758 = 


7 =°0375. 


TABLE V. 
Total Material. 


Offspring. 


Totals 





Offspring. 


Totals 





1--3 | 735 | 1933 | 
4-8 529 | 1063 | 





reat-grandparents. 


Totals 26 2996 


‘ 
7 








( 





| Totals] 5 3: | 299 | 91 | 2996 
l 





19705, £=°37238, 
0259174 + °137987° + 036697? +7 — 18455 = 0, 


r= °1824., 














